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Abstract 



It is believed that strong ferromagnetic interactions in some solids are generated by 

subtle interplay between quantum many-body effects and spin-independent Coulomb 

interactions between electrons. It is a challenging problem to verify this scenario in 

the Hubbard model, which is an idealized model for strongly interacting electrons in a 

^ ■ sohd. 

Nagaoka's ferromagnetism is a well-known rigorous example of ferromagnetism in 

'^ I the Hubbard model. It deals with the limiting situation in which there is one fewer 

C ' electron than in the half- filling and the on-site Coulomb interaction is infinitely large. 

There are relatively new rigorous examples of ferromagnetism in Hubbard models called 

flat-band ferromagnetism. Flat-band ferromagnetism takes place in carefully prepared 

models in which the lowest bands (in the single-electron spectra) are "flat." Usually, 

^ ■ these two approaches are regarded as two complimentary routes to ferromagnetism in 

^^ , the Hubbard model. 

p^ I In the present paper we describe Nagaoka's ferromagnetism and flat-band ferro- 

CN ■ magnetism in detail, giving all the necessary background as well as complete (but 

elementary) mathematical proofs. By studying an intermediate model called the long- 
Qv^ I range hopping model, we also demonstrate that there is indeed a deep relation between 

these two seemingly different approaches to ferromagnetism. 

We further discuss some attempts to go beyond these approaches. We briefly discuss 

recent rigorous example of ferromagnetism in the Hubbard model which has neither 

'^ ' infinitely large parameters nor completely flat bands. We give preliminary discussion 

^ ■ regarding possible experimental realizations of the (nearly-)flat-band ferromagnetism. 

Q , Finally, we focus on some theoretical attempts to understand metallic ferromagnetism. 

We discuss three artificial one-dimensional models in which the existence of metallic 

ferromagnetism can be easily proved. 

We have tried to make the present paper as self-contained as possible, keeping in 
a I mind readers who are new to the field. Although the present paper is written as a 

review, it contains some material which appears for the first time. 



^Invited paper; Prog. Thoer. Phys. 99 (1998). Archived as cond-mat/9712219. 
^hal. tasaki@gakushuin.ac.jp, http://www.gakushuin.ac.jp/~881791/ 
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1 Introduction 

1.1 Hubbard model and the origin of ferromagnetism 

The origin of strong ferromagnetic ordering observed in some materials has been a mystery 
in physical science for a long time. Since quantum mechanical many-electron systems with- 
out interactions universally exhibit paramagnetism, the origin of ferromagnetism should be 
sought in electron-electron interactions. In most solids, however, the dominant part of the in- 
teraction between electrons is the Coulomb interaction, which is perfectly spin-independent. 
We are thus faced with a very interesting and fundamental problem in theoretical physics 
to determine whether spin-independent interactions in an itinerant electron system can be 
the origin of ferromagnetic ordering. This problem is important not only because ferromag- 
netism is a very common (and useful) phenomenon, but because it focuses on the fundamental 
role of nonlinear interactions in many-body quantum mechanical systems. 

It was Heisenberg [1] who first realized that ferromagnetism is an intrinsically quantum 
mechanical phenomenon. In Heisenberg's approach to ferromagnetism, one starts from the 
picture that each electron (relevant to magnetism) is almost localized in an atomic orbit. 
By treating the effect of the Coulomb interaction and overlap between nearby atomic orbits 
in a perturbative manner as in the Heitler-London theory, Heisenberg concluded that there 
appears an "exchange interaction" between nearby electronic spins which determines the 
magnetic properties of the system. 

In a different approach to the problem of ferromagnetism, which was originated by Bloch 
[2], one starts from the quantum mechanical free electron gas, in which electrons are in 
plane-wave like states. One then treats the effect of the Coulomb interaction perturbatively, 
and tries to find instabilities against certain magnetic ordering. When combined with the 
Hartree-Fock approximation (or a mean-field theory), this approach leads to the picture that 
there is an instability with respect to ferromagnetism when the density of states at the fermi 
energy and the Coulomb interaction are sufficiently large. 

In spite of a considerable number of attempts to improve these ideas, neither of these 
two approaches has yet produced a truly convincing explanation about the origin of ferro- 
magnetism. 

A modern version of the problem of the origin of ferromagnetism was formulated by 
Kanamori [3], Gutzwiller [4], and Hubbard [5] in the 1960's^. They studied simple tight- 
binding models of electrons with on-site Coulomb interaction. This model is usually called 
the 'Hubbard model.' When there is no electron-electron interaction, the model exhibits 
paramagnetism as an inevitable consequence of the Pauli exclusion principle. Among other 
things, Kanamori, Gutzwiller, and Hubbard asked whether the paramagnetism found for a 
non-interacting system can be converted into ferromagnetism when there is a sufficiently 



^A similar formulation was given earlier, for example, in [6]. 



large Coulomb interaction. This is a concrete formulation of the fundamental problem that 
we alluded to in the opening of the previous subsection. 

It is worth noting that the on-site Coulomb interaction itself is completely independent 
of electronic spins, and it does not favor any magnetic ordering. One does not find any 
terms in the Hubbard Hamiltonian which explicitly favor ferromagnetism (or any other 
ordering). Our theoretical goal is to show that magnetic ordering arises as a consequence of 
the subtle interplay between the kinetic motion of electrons and the short-ranged Coulomb 
interaction. It is interesting to compare this situation with that in spin systems, where one 
is often given a Hamiltonian which favors some kind of magnetic ordering, and the major 
task is to investigate if such ordering really takes place. We can say that the Hubbard 
model formulation goes deeper into fundamental mechanisms of magnetism than that of 
spin systems. It offers a challenging problem to theoretical physicists to derive magnetic 
interaction from models which do not explicitly contain such interactions. 

From a more global point of view, the importance of the Hubbard model may be under- 
stood from the philosophy of "universality", which, in our opinion, is at the very heart of 
contemporary physics. We believe that nontrivial physical phenomena or mechanisms found 
in a suitable idealized model can also be found in other systems in the same "universality 
class" as the idealized model. We expect that the universality class is often large and rich 
enough to contain various realistic systems with complicated details which are ignored in 
the idealized model. As for strongly interacting electron systems, the Hubbard model is 
regarded as one of the most promising candidates for an idealized model to be used in our 
search of possible universality classes. 

Perhaps the best justification of the Hubbard model as a standard model of itinerant 
electron systems comes from such theoretical considerations, rather than its accuracy in 
modeling narrow band electron systems. 

1.2 Rigorous results 

The problem of ferromagnetism in the Hubbard model was studied extensively using various 
heuristic methods. The Hartree-Fock approximation discussed above leads one to the so 
called Stoner criterion. It states that the Hubbard model exhibits ferromagnetism when the 
condition UDp > 1 is satisfied, where U is the the strength of the on-site Coulomb interaction 
and Dp is the density of states of the corresponding single-electron problem measured at the 
fermi level (of the corresponding non-interacting system). Although the criterion cannot 
be trusted literally^, it guides us to look for ferromagnetism in models in which U is large 
and/or the density of states is large. 

"^There are many systems which fully satisfy the criterion but do not exhibit ferromagnetism. Flat-band 
Hubbard models with low electron densities [7, 8] are typical examples. 



The first rigorous result about ferromagnetism in the Hubbard model, which is one of 
the main topics of the present paper, was provided by Nagaoka^ [10]. It was proved that 
certain Hubbard models have ground states with saturated ferromagnetism when there is 
exactly one hole and the Coulomb repulsion U is infinite. 

In 1989, Lieb proved an important general theorem for the Hubbard model on a bipartite 
lattice at half- filling [11]. As a corollary of this theorem, Lieb showed that a rather general 
class of Hubbard model exhibits ferrimagnetism^. See also [12]. 

In 1991, Mielke [13, 14] found a new class of rigorous examples of ferromagnetism in the 
Hubbard model. He showed that the Hubbard models on a general class of line graphs have 
ferromagnetic ground states. A special feature of Mielke's models is that the corresponding 
single-electron Schrodinger equation has highly degenerate ground states. In other words, 
Mielke's models have fiat (or dispersionless) bands. Mielke' original results were for the case 
in which the number of electrons corresponds to the half-filling of the lowest flat band, but 
later it was extended to different electron densities in two-dimensional models [15]. 

A similar but different class of examples of ferromagnetism in Hubbard models, which 
we will discuss in detail in the present paper, were proposed in [7, 8]. These models are 
defined on a class of decorated lattices with "cell structures" , and are also characterized by 
fiat bands at the bottom of the single-electron spectrum. In a class of models in two and 
higher dimensions, it was proved that the ferromagnetism is stable against fluctuations in 
the electron number [7, 8]. The ferromagnetism in Mielke's models and those in [7, 8] is now 
called "flat-band ferromagnetism", and regarded as one of reliable starting points for the 
problem of ferromagnetism. 

There are also rigorous results for ferromagnetism in 'nearly-flat-band models' obtained 
by perturbing the flat-band models of [7, 8]. In a general situation, local stability of ferromag- 
netism is known [16, 17], and for a special class of models, global stability of ferromagnetism 
has been established [18, 19]. We shall also briefly discuss the latter results in the present 
paper. 

Nagaoka's ferromagnetism takes place in models with U -^ oo, while (nearly-)flat-band 
ferromagnetism takes place in models characterized by large (or infinite) density of states 
Dp. One might say that these two rigorous results realize the Stoner criterion UDp > 1 
through complementary paths. 

1.3 About the present paper 

One of the main aims of the present paper is to give complete descriptions of Nagaoka's ferro- 
magnetism and flat-band ferromagnetism of [7, 8]. In particular, the proof of the generalized 

'"'Thouless [9] reached a similar but slightly weaker conclusion. 

^Ferrimagnetism is a kind of antiferromagnctism on a bipartite lattice such that the numbers of sites in 
two sublattices are different. 



version of Nagaoka's theorem is described in complete detail for the first time. Another 
important aim is to show that there is a close relation between these two approaches which 
are usually regarded as complementary. We demonstrate this fact by studying an artificial 
model which we call the 'long range hopping model.' Our hope is that a clarification of the 
relation between the two approaches will lead us to a more global view of ferromagnetism in 
the Hubbard model, and that this might lead us in the long run to a better understanding 
of the essence of the fascinating phenomenon of ferromagnetism. 

We also discuss three topics related to these results. First, we briefiy discuss important 
attempts to go beyond fiat-band ferromagnetism by treating non-singular Hubbard models. 
One of the main achievements in this direction is a proof of the existence of ferromagnetism 
in a Hubbard model which has a finite Coulomb repulsion U and a finite density of states 
Dp. Next, we give a preliminary discussion of some experimental results which may be 
relevant to fiat-band ferromagnetism. The ferromagnetism seen in La4Ba2Cu20io, which 
was discovered back in 1990, shows some striking similarities with fiat-band ferromagnetism. 
Finally we describe some attempts to obtain theoretical examples of metallic ferromagnetism. 
There are strong indications that some one-dimensional Hubbard models exhibit metallic 
ferromagnetism, but there are no rigorous results yet. We discuss three artificial models in 
one-dimension which are easily shown to exhibit metallic ferromagnetism. The results for 
the third model, the limiting U-V model, are perhaps new. 

We have tried to make the present paper as self-contained as possible^. We carefully start 
from basic assumptions and definition of the Hubbard model, keeping in mind readers who 
are new to the field. We give complete proofs to all the theorems which are directly related 
to the main subjects of the paper. The proofs, however, may not be optimally organized 
from a mathematical point of view. Instead, we have tried to present "readable" proofs from 
which the readers can learn physical mechanisms underlying the theorems. (This comment 
applies even to the proof of purely mathematical theorems such as Theorem 4.3.) 

The present paper is organized as follows. In Section 2, we give a complete definition 
of the Hubbard model. We start from the description of the single-electron problem, and 
proceed by defining fermion operators, many-body Hilbert space, and the Hubbard Hamil- 
tonian. An expert can safely skip this entire section, provided that he/she takes a brief 
look at Sections 2.1 and 2.3 to note our notation in the coordinate-free formalism of fermion 
operators. 

Section 3 is also introductory and standard. We define the spin angular momenta of the 
model, and give a precise definition (Definition 3.1) of what we mean by ferromagnetism. 
We also present some results which rule out the possibility of ferromagnetism under several 
conditions. 



^For related reviews of mathematically rigorous results in the Hubbard model, see [20, 21]. Unfortunately 
there are many interesting related topics that we do not even mention in the present paper. For recent reviews 
of related topics from complementary points of view, see [22, 23, 24, 25]. 



In Section 4, we present a complete description and a proof of Nagaoka's ferromagnetism 
in its most generalized form. 

In Section 5, we introduce and discuss an artificial model that we call the 'long-range 
hopping model.' Ferromagnetism in this model is first regarded as a special case of Nagaoka's 
ferromagnetism, but a different point of view is presented. This new picture naturally leads 
us to flat-band ferromagnetism. 

In Section 6, we introduce flat-band ferromagnetism as a natural generalization of the 
long-range hopping model. We also discuss some results for the nearly-flat-band models 
obtained by adding a perturbation to the flat-band models. 

In Section 7, we focus on experimental results which may be relevant to flat-band ferro- 
magnetism. 

In Section 8, we present some results about the possibility of metallic ferromagnetism in 
the Hubbard model. Preliminary rigorous results for related models in one-dimension are 
discussed. 

Technical material is contained in the Appendices. In Appendices A and B, we discuss 
the gauge transformation and the hole-particle transformation, respectively. In Appendix C, 
we summarize some useful properties of positive semideflnite operators. In Appendix D, 
we present (mainly for mathematicians) an explicit construction of the Hilbert space and 
fermion operators. In Appendix E, we explain (for readers without background in condensed 
matter physics) the elementary notion of band structures for a single-electron in a tight- 
binding description. In Appendix F, we present technical calculations required in the proof 
of Theorem 3.3. 

2 Definition of the Hubbard model 

2.1 Tight-binding description of a single electron 

Before introducing the Hubbard model, we describe the corresponding single-electron prob- 
lem. Let lattice A be a collection of A^s sites. Lattice sites x,y, . . . G A represent atomic sites 
in a solid. In the tight-binding description, which is a kind of low-energy effective theory, 
we declare that electrons can live only on lattice sites. In the single- orbital model that we 
study here, we further assume that each atomic site carries a single non-degenerate® orbital 
state. Of course, actual atoms can have more than one orbit (or band). The philosophy 
behind the model building is that those electrons in other states do not play signiflcant roles 
in determining the low-energy physics in which we are interested, and can be "forgotten" for 
the moment. See Figure 1. 



Hn some solids, the degeneracy in the original atomic orbit is lifted by crystalline anisotropy. 




c) 




d) 



Figure 1: Highly schematic figures which explain the philosophy of tight-binding descriptions, 
a) A single atom which has multiple electrons in different orbits, b) When atoms come 
together to form a solid, electrons in the black orbits become itinerant, while those in the 
light gray orbits are still localized at the original atomic sites. Electrons in the gray orbits are 
mostly localized around the atomic sites, but tunnel to nearby gray orbits with non-negligible 
probabilities, c) We only consider the electrons in the gray orbits, which are expected to 
play essential roles in determining various aspects of low-energy physics of the system, d) If 
the gray orbit is non-degenerate, we get a lattice model in which electrons live on lattice 
sites and hop from one site to another. 



Then, a quantum mechanical state of a single electron (with a fixed spin) is described 
by a vector state ip = (v9^)j,gA with cp^ E C. We denote by [) = C^=(= £2(A;C)) the 
iVg- dimensional Hilbert space formed by such cp. The inner product of two states (p and 

■0 = {i^x)x&A e f) is 

{ip,i^) = j2ivxr^x. (2.1) 

xeA 

For x,y E A with x ^ y, we denote by tx,y the quantum mechanical amplitude that an 
electron hops (i.e., tunnels) from the site y to x. We assume tx^y is reaF and symmetric 
as tx^y = ty^x- For a; G A, we denote by t^^x the (real valued) potential energy^'^ (which we 
usually write Vx) for the electron at site x. Then the Schrodinger equation for stationary 
states becomes 

e({>x = ^ tx,y (Py, for any x E A, (2.2) 

2/6A 

where e is the energy eigenvalue^'^. By introducing the hopping matrix T = {tx^y)x,yeA, which 
is a matrix on [), this can be written in a coordinate free vector form as 

ecp = Tip. (2.3) 

Since T is real symmetric, the eigenvalue equation (2.3) has A^s eigenvalues £i, . . . ,£Ns- 
The corresponding eigenstate^^ V = (w )xgA satisfies 

e^.-0(i) = T^(i\ (2.4) 

for each j = 1, . . . , A^g. We can assume that the energy eigenvalues are ordered as Sj < £j+i, 

and eigenstates are orthonormal in the sense that (il>, ij) ) = Sjjr. 

Let us consider a standard example. We take the one-dimensional lattice A = {1,2,..., A^s}, 
and set tx,x+i = 'tx+i,x = —t for all x E A, and tx,y = otherwise. We impose a periodic 
boundary condition and identify A's + 1 with 1. The eigenvalues and eigenstates can be most 

^In a system under magnetic field, t^^y is generally complex and satisfies {tx,y)* — ty_x- 
^°We do not want to refer to Vx{= tx,x) as "site-dependent chemical potential." The notion of the chemical 

potential makes sense in the contexts of thermodynamics or statistical physics. This notion, in our opinion, 

should be distinguished from the purely quantum mechanical notion of the potential (although there is a 

relation between the two). 

^^It is also common to put a minus sign in front of tx^y in (2.2). We note that the hopping amplitude 

tx,y depends on the delicate overlap between orbital wave functions, and there are no simple principles with 

which we can determine their magnitudes or signs. Also see Appendix A for discussion of the arbitrariness 

of the sign of tx.y 

^^In notation like V'^ , the superscript j is the name of the state and the subscript x is the index for its 

component. 
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naturally indexed as 77^'^-' = {r]x )xeA and €{k), respectively, by the wave number k = 27rn/Ns 
where n = 0, ±1, ±2, . . . , ± {(A^s/2) — 1} , Ns/2 (assuming A'"s is even). Then we have 

^f = 4f e^'^ (2.5) 

and e{k) = —2tcosk. Note that the eigenstates are described by plane waves. 

2.2 Fermion operators and Hilbert space 

We define the Hilbert space for many-electron problems using the fermion operator formal- 
ism^^. For each lattice site x G A and spin index a =^, i, we associate a fermion operator 
Cx,a- One can freely take the conjugate^^, products, and linear combinations (with complex 
coefficients) of these operators (and the identity operator) to get new operators. We require 
that these operators satisfy the anticommutation relations^^ 

{4,a^Cy^r} =Sx,ySa,r, (2.6) 

and 

{4,<x> 4,r} = {Ca;,a, Cy^r} = 0, (2.7) 

for any x,y & A and a, r =1. |, where {A, B} = AB + BA. Note that (2.7) implies {cx aY — 

(4,.)^ = 0. ^ 

Physically, Cx^a and cj. ^ are interpreted as the operators which respectively annihilate 
and create an electron at site x with spin a. The corresponding number operator is defined 

as 

nx,CT = 4,aCx,a- (2.8) 

From (2.7), we find that number operators with different indices commute with each other. 
From (2.6) and (2.7), we see that (n^,^)^ = 4,^c^,^4_„c^,<^ = cl^^{l-cl^^Cx,a)cx,a = 4,<7Cx,<7 = 
Ux^a- Thus we obtain Ux^ai^ — fix^a) = 0, which means that Ux^a can only have eigenvalues 
and 1. This is a mathematical realization of the Pauli exclusion principle. The number 
operator for site x is defined as Hx = Ux^i + Ux^i- 

^•^The formalism is also called "second quantization." One should recall, however, that we are working 
with a many-body problem which is "quantized" only once. The explicit relation to the "first quantization" 
formalism can be found in Appendix D. Readers with background in fields like functional analysis are 
suggested to take a look at this appendix. 

^''We denote the conjugate of an operator Ahy A^ . We write cj, g. instead of (ca;,^-)^. 

"'^^The right-hand side of (2.6) means Sx,ySa,T times the identity operator. Throughout the present paper, 
we refrain from writing the identity operator explicitly. 
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Figure 2: An allowed configuration in a tight-binding model with a single orbit per site. 
Each site in the lattice can be either empty, singly occupied by an electron with up or down 
spin, or doubly occupied by electrons with opposite spins. If we consider the Hubbard type 
interaction, we get an extra energy U > whenever two electrons occupy a single site. (See 
Section 2.5.) The interaction energy for the above configuration is therefore 3f/. 

We can now construct the Hilbert space for many-electron problems. We start from a 
single (vector) state $vac, which physically corresponds to a (fictitious) state with no electrons 
in the lattice. This property is mathematically represented as 

Cx,a^va.c = for any x E A and a =1, |. (2.9) 

We fix a positive integer A'"c such that 1 < iVg < 2A^s) which is the total number of 
electrons in the system. It is useful to express the electron number in terms of the filling 
factor z/ = Nc/{2Ns), which takes a value in the range < z/ < 1. Take iVg arbitrary sites 
xi,X2, ■ ■ ■ , xn^ G a (with possible overlaps) and spin indices (Xi, (72, . . . , (Jn^ =T? i; ^ind define 
the state 

^xi,cri;X2,(T2;...,;XN^,(Tff^ ~ '^xi,ai'^X2,(T2 ' ' ' '^XMc,crN^/^^^^' l^'-'-Uj 

We interpret (2.10) as the state in which there is an electron at site Xi with spin cXi for 
i = 1, . . . , A'^e. We allow all the possible states of the form (2.10), but make identifications 
according to the anticommutation relations (2.7). In particular, the state (2.10) is vanish- 
ing whenever (xj,(Tj) = {xj,(Tj) for some i ^ j. This is nothing but the Pauli exclusion 
principle^^. In other words, each site in the lattice can be either empty, singly occupied 
by an electron with up or down spin, or doubly occupied by electrons with opposite spins 
(Figure 2). Also note that the anticommutation relation (2.7) leads to relations like 

which is the well-known rule that a state changes its sign when one exchanges the names of 
two fermions. 

The Hilbert space TYat^ for the given electron number Ac is generated by all the basis 
states of the form (2.10). The inner product^^, which we again denote as (■, ■), is defined by 

^^If we set A'e > 2A's, then the state (2.10) always vanishes. 

^ We do not use Dirac notation since we encounter many non-hcrmitian operators. Try rewriting the 
equation (^$,5*) = (^^,A^B^^) in the Dirac notation. 
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($vac, *^vac) = 1 and the anticommutation relations. We denote the norm of a state $ G 'Hn^ 
as ||$|| = ($,$)^/^ 

We remark that there can be no physical situations which are represented by hnear 
combinations of states with different electron numbers. To see this, we take two states 
$ G "Hnc and $' G "Hn^ with iVe 7^ N'^. Since the total charge in the universe is conserved 
(as far as we know), the only (physically) possible way to linearly combine $ and $' is to 
form a state like S = $ eg) V^^ + $' (g) Fat^. Here Fat^ and Fat^ are some states of the whole 
outside world with total charge iVg and iV^, respectively. Note that the states Fjv^, and Fjv^ are 
inevitably orthogonal. Therefore if A is any operator of the electron system, its expectation 
value in S becomes 

{A)^ = (H,(A®1),S) 

= ($, A<!>) (F^,, F^J + ($', A$') (F^,, Fat,) 

= cj ($, A$) + cj' ($', A$') , (2.12) 

where 1 is the identity operator for the outside world. This means that the state S must be 
represented as a mixed state^^ if we restrict ourselves to the electron system^^. 

2.3 Coordinate free notation 

We introduce the standard coordinate-free notation for fermion operators. The systematic 
use of this notation simplifies some routine calculations. For a single-electron state cp = 
{<^x)xeA G f), we define 

Cl{^) = J2v.cl., C^{^) = J2iv.rc,,.- (2.13) 

These operators satisfy the generalized canonical anticommutation relations 

{c,{cp),Cl{^)}=6^y{cp,,l,), (2.14) 



and 



{c^{^), a'(V')} = {cii^), cii-ii^)} = 0, (2.15) 



"'^^Note that the expectation value in the (unphysical) pure state S' — q:$ + «'$' becomes {A)~, = 
|ap ($, A^) + |a'p ($', A$') + {a*a' ($, A$') + (h.c.)}, which has extra cross terms. 

""^^ Although this remark might sound entirely trivial (and it indeed is), many beginners (and even some 
experts) are confused when they encounter BCS-type states which are linear combinations of states with 
different Nc- One should always keep in mind that such linear combinations are introduced for purely 
mathematical (or theoretical) reasons (which are indeed deep and beautiful) and have nothing to do with 
realistic particle number fluctuations. When one has such a linear combination state, one always gets 
physically meaningful states with fixed electron numbers by taking suitable projections. 
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for any (^, i/? G f) and a, a' =], [. 

Again (2.15) implies (C^(c^))^ = (Co-((/j))^ = 0. We also have the following property. 

Lemma 2.1 (Linear independence and the product of creation operators) Let 

if^^^ , • • • , '^*-"-' he n arbitrary states in f) . The state 

Ct(c^«)...Ct(^("))$_, (2.16) 

which is usually called the Slater determinant state (see Appendix D), is nonvanishing if and 
only if (p''^\ . . . , cp*-"-* are linearly independent. 

Proof: Let O = Cl{ip^^^) ■ ■ ■ Cl{ip^"'^). Assume that the n states are linearly dependent. 
Then we can write cp^^^ = Yll=2'^j^^''^ with some coefficients aj G C, and hence Cl.{(p^^^) = 
'^^=2^j^l('fi^''^)- "^^^ product O vanishes because {Cl{(p^^^)y = 0. 

Next we assume that the n states are linearly independent and show that 0$vac is 
nonvanishing. By repeatedly using (2.14), (2.15) and Co-(v')$vac = 0, we get 

||0$vacf = (•I'vacO^O^.ac) 

= ($.,e, a(V'(")) ■ ■ ■ a(¥'(^))C7!(^(^)) ■ ■ ■ Ct (¥'("))$vac> 

J2 (-l)P((^«,(pWi)))...((^("),(^(pW)), (2.17) 

p:(l,...,n)^(p{l),...,p{n)) 

where p is summed over the n! permutations of (1,2, ... ,n), and (—1)^ denotes the parity 
of p. The Gramm matrix G is an n x n matrix defined by (G')ij = {^^^\ (f^^^). Then (2.17) 
implies 

||0$.acf = det(G'). (2.18) 

It is a well-known fact in linear algebra that G is regular (and hence det(G') ^ 0) if and only 
if (p^^\ . . . , (f^"'^ are linearly independent. ■ 

The following lemma gives a basic characterization of the Slater determinant states (2.16). 
Although we do not use the lemma in the present paper, we state it here since it is enlight- 
ening. We encourage interested readers to prove the lemma. 

Lemma 2.2 (Slater determinant depends on a subspace) Suppose that two sets 
{(p^^\ . . . , (f^'^^ and {i/?^ , • • • , V' } c"/ states in f) span the same n-dimensional subspace of 
\). Then there is a nonvanishing constant c G C, and we have 

CM'^) ■ ■ ■ C^!(¥'("))$vac = CCM^^) ■ ■ ■ Ct (V.W)<|._. (2.19) 
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The following lemma is elementary, but of fundamental importance. 

Lemma 2.3 (General basis of I^n^) Let ^'^^\ . . . , 93*^^=-' be N^ arbitrary states in () which 
are linearly independent (but not necessarily orthogonal with each other). Then the states 

^s,,s, = f n c'M^) I f n ^'M^^ I "^-c (2.20) 

for arbitrary subsets Si,Si of {1, 2, . . . , A^s} such that^^ \S^\ + \Si\ = N^ span the Hilbert 
space Hnc ■ 

Proof: Since the Ng vectors are linearly independent, we have 5x,y = Ylj=i '^xj'fy with some 
regular matrix {ax,j)xeA,j=i,...,Ns- Thus we have 

<. = Y.^x,Ci{v'^'^), (2.21) 

i=i 

which means that the basis state (2.10) can be written as a linear combination of the states 
(2.20). ■ 

Let A = {ax^y)x,yeA be a matrix on [). We define the corresponding bilinear form of 
fermion operators 

B{/\)= 5Z4,<xa.,,c,,,, (2.22) 

x,y£A 

which is usually called the "second quantization" of A. The easily verified commutation 
relation 

[B{A),Ci{^)]=CliAcp), (2.23) 

where [A, B] = AB — BA, sheds light on the relation between the single-particle quantum 
mechanics and the operator formalism. This will be useful later. A similar commutation 
relation for two bilinear forms 

[B{A),B{B)] = B{[A,B]) (2.24) 

is also easy to prove. 



20 



Throughout the present paper, we denote by \S\ the number of elements in the set S. 
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We have so far treated the spin index a separately from the coordinate x. It is of 
course possible to treat the combination (x, a) as the coordinates of the system and con- 
sider the corresponding 2A''s-dimensional single-electron Hilbert space^^. We can then re- 
state all of the above results in the new language with suitable modifications. Let us 
only mention the relation corresponding to (2.24), because it will be useful later. Let A 
be a 2A^s x 2A^s matrix indexed by (a;, a) with a; G A and a =t, i, and define -B(A) = 
Sx veA X]o-T=T I '^x,o-(^)(^,o-),(y,T) Cy^r- Then for two such matrices A and B, we have 

[5(A),5(B)] = fi([A,B]), (2.25) 

where [A, B] is the commutator as 2A^s x 2A^s matrices. 

2.4 Non- interacting system 

Now that the Hilbert space has been prepared, we can introduce Hamiltonians. The Hamil- 
tonian which describes the quantum mechanical hopping with the hopping matrix T = 

\J^x,y)x,y£A IS 

i^^hop = B{T) = J2 t-,y 4,.c,,.. (2.26) 

x,y£A 

To see that (2.26) describes the desired hopping, define the single-electron state $(^ = 
C^(c^)<l>vac for y ^ f)- Then from the commutation relation (2.23), we immediately find that 
the Schrodinger equation e^ip = Hi^op^cp is equivalent to the single-electron Schrodinger 
equation (2.2) or (2.3). 

When one has diagonalized the single-electron Schrodinger equation (2.2), it is easy to 
also diagonalize the many-body Hamiltonian (2.26). As in Section 2.1, we let V be the 
eigenstate satisfying (2.4) with energy Ej, for j = 1,2, . . . ,Ns. Define the new fermion 
operator by 



a:- 



t - nU-iUi:!) 



CM^^), (2.27) 



for j = 1, 2, . . . , A^s and a =], |. 

Let S] and Si be arbitrary subsets of {1, 2, . . . , As} such that 15*11 + 15*11 = A'e, and define 




^s„s,= \ W a\A \ \\a\\ $va.. (2.28) 

Lemma 2.3 shows that these states form a basis of 'Hn^- Note that the commutation relation 
(2.23) and the eigenstate equation (2.4) imply [i/hop, 4^ = [^(T), '^^(V'^^^)] = C'liTV'^^^] = 
^^We shall use such a formalism in Appendix D. 
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Figure 3: Schematic picture of the ground state of a non-interacting many-electron system. 
The lowest Nc/2 single-electron energy levels are "filled" by both up spin and down spin 
electrons. The state naturally exhibits paramagnetism known as 'Pauli paramagnetism.' 

^jCtj^cr- This and i/hop*^vac = immediately imply 

i^hop^5„5, = \y^^, + y^^j\ ^s„sy (2.29) 




We have thus diagonalized -f:rhop. 

By choosing subsets S^, Si which minimize the energy eigenvalue J2jes ^i + X]ie5 ^i' ^^ 
get ground state(s) of the present non-interacting model. In particular, if the corresponding 
single-electron energy eigenvalues are nondegenerate, i.e., Ej < £j+i, and if N^ is even, the 
ground state of -f^hop is unique and written as 

$GS = I TT «iT«ii "^vac (2.30) 

This is nothing but the state obtained by "filling up" the low energy levels with up and down 
spin electrons, as one learns in elementary quantum mechanics (Figure 3). 

In a single-electron eigenstate of the one-dimensional example in Section 2.1, the electron 
is in a plane wave state (2.5) with a definite wave number k. The same is true for any 
translation invariant model, as we see in Appendix E. The fact that the Hamiltonian ifhop 
is diagonalized in the basis (2.28) implies that the electrons behave as "waves" in this non- 
interacting (Hubbard) model. 
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2.5 Interaction 

It is believed that interactions between electrons in a solid mainly come from the electro- 
static Coulomb force. Although the Coulomb force in vacuum is long-ranged, we consider an 
extremely short-range interaction which acts only when two electrons occupy the same site 
(i.e., the same atomic orbit). A crude justification of such a short-range (Coulomb) inter- 
action comes from the observation that the Coulomb force should be most dominant when 
two electrons approach within the minimum possible distance. In a slightly more sophisti- 
cated justification, one argues that the long range Coulomb force is screened by electrons in 
different orbital states which we have decided to forget. Our point of view, however, is that 
models with artificial short-range interactions are worth studying because they are among 
the minimum models which can be studied to elicit universal properties of strongly interacting 
electron systems. 

The Hamiltonian for the short-range Coulomb repulsion is 

Hir,t = U'^n^^-fnx,i, (2.31) 

xeA 

where f/ > is the energy from the repulsive interaction. See Figure 2. 

The expression (2.31) is already diagonalized, and it is trivial to write down eigenstates 
of -f/^int- Let L|, Li be arbitrary subsets of the lattice A such that \L^\ + |LJ = A'^e, and define 



^h,L, = I TT cl, I I n 4,, I $_. (2.32) 

Then by using the commutation relation [n^^a, c^' o-'] ~ ^x,x'^a,(T'cl ai ^^ ^"^^ 

Hint'^L^L, = Um n L^\^L,M- (2-33) 

The ground state for a given electron number Ae can be constructed by choosing subsets 
L^,L| that minimize the energy eigenvalue f/|L^ l~l -^il- When Ae < Ag, one can always 
choose L| and L^ so that L| fl L^ = holds. In this case, the ground states are highly 
degenerate and have energy equal to 0. 

It is clear (from the beginning) that the interaction Hamiltonian H^^^ is most naturally 
treated if we regard electrons as (classical) "particles" which live on lattice sites. 

2.6 Hubbard model 

The Hubbard model describes a tight-binding electron model in which electrons hop around 
the lattice and interact with each other through short-range repulsive interactions. The full 
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Hamiltonian of the single-orbital Hubbard model^^ is simply 

H = //hop + ^inf (2.34) 

We have already seen that both i/hop and iJint can be easily diagonalized. We have 
observed, however, that electrons behave as "waves" in i^hop; while they behave as "particles" 
in i/int- How do they behave in a system whose Hamiltonian is a sum of these totally different 
Hamiltonians? This is indeed a fascinating problem which is deeply related to the wave- 
particle dualism in quantum physics. We might say that many of the important models in 
many-body problems, including the (^^ quantum field theory and the Kondo problem, are 
minimum models which take into account both the wave-like nature and the particle-like 
nature (through point-like nonlinear interactions) of matter. 

From a technical point of view, the wave-particle dualism implies that the Hamiltonians 
i?int are -ffhop do not commute with each other. Even when each Hamiltonian is diagonal- 
ized, it is still highly nontrivial (or impossible) to find the properties of their sum. Of course, 
mathematical difficulty does not automatically guarantee that the model is worth studying. 
A truly exciting characteristic of the Hubbard model is that, though the Hamiltonians i/hop 
and ifint do not favor any nontrivial order, their sum H = i/hop + -f^int is believed to generate 
various types of nontrivial order including antiferromagnetism, ferromagnetism, and super- 
conductivity. When we sum up the two innocent Hamiltonians -ffhop and Hi^t, competition 
between their wave-like and particle-like characters (or between linearity and nonlinearity) 
takes place, and one gets various interesting "physics" . To confirm this fascinating scenario 
is a challenging problem in theoretical and mathematical physics. 

3 Magnetism in the Hubbard model 

3.1 Spin operators and SU{2) invariance 

We introduce the total spin operators^^ Stot = (5'tot, S^^u ^Lt) of the system by 

'-'tot ~ 2 Z-^ Z-^ (^x,a\P )a,TCx,a-, (3.1) 

xGA cr,T=t,i 

for q; = 1, 2, and 3, where p*^"-* are the Pauli matrices. It is clear from the general commutation 
relation (2.25) that^'' S[J^,S[J^,S[J^ satisfy the standard commutation relation for quantum 

^^This model is often called the single-band Hubbard model. We find this terminology confusing since 
a single-orbital model can have multiple bands depending on the structure of the hopping matrix T. See 
Appendix E. 

^"^We use the index a = 1, 2, 3 to indicate the three axes, because the symbols x, y, z are already used as 
lattice sites. 

24Note that Si:^ = B(p(")) with (p("))(,,,),(^,,) - (5,,,(p(")),,,. 
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mechanical angular momentum operators. We define the raising and lowering operators by 

c± _ c{i) + „• c(2) 

'^tot ~ '-'tot ^ '"-'tof^ 

The operators Stot are the generators of the global SU{2) rotations in the spin space. We 
say an operator A is SU{2) invariant if it commutes with S^^^ for a = 1,2, and 3. Intuitively 
speaking, an SU{2) invariant operator does not change if we change the "quantization axis" 
of spins in an arbitrary manner. 

A typical SU{2) invariant operator is the number operator rix = Ux^i +^x,i- The hopping 
Hamiltonian -ffhop of (2.26) is also SU{2) invariant. These facts can be checked easily by 
using the general commutation relations (2.25). The interaction Hamiltonian i^int of (2.31) 
is also SU{2) invariant. This fact becomes evident if we note nx^iUx^i = {(^x)^ — nx}/2. 

Knowing that the Hamiltonian H = -f^hop + -f^mt is SU{2) invariant, we can make use 
of the standard representation theory of quantum mechanical angular momenta. We note 
that the three operators H, S'^J, and (Stot)^ = X]a=i 2 3('^tot )^ commute with each other. 

(3) 
We will always consider simultaneous eigenstates of these operators. We denote by S^^^l the 

eigenvalue of S'^J, and by S'tot(5'tot + 1) the eigenvalue of (Stot)^- We call S'tot the total spin 

of the state. Let 

c. _JNJ2 ifiVe<iV,; 

^max I (2iV, - iVe)/2 if iV, > iV,. ^^'^^ 

Then the allowed values of S'tot are S'tot = 0,1,..., S'max if N^ is even, and S'tot = 1/2; 3/2, . . . , S'n 
if Np is odd. 
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3.2 Ferromagnetism 

Ferromagnetism is probably the most intuitive among various magnetic phenomena exhib- 
ited by solids. Here we shall be rather restrictive and only consider the strongest form of 
ferromagnetism, namely, saturated ferromagnetism in ground states. 

Definition 3.1 ^ Hubbard model is said to exhibit ferromagnetism if any ground state of 
H has the total spin Stot = Sn 



-"max' 



(3) 

Recall that any state with S'tot = S'max has its copies in the subspaces with S'tot = 
— S'max, —S'max + 1, • • • , S'max- Therefore the ground states are at least (2S'max + l)-fold degen- 
erate when there is ferromagnetism (in the above strong sense). Let <I>gs be a ground state 
with S'tot = "S'max- Since $gs contains only up spin electrons, we must have i^mt^GS = 0. 
This means that <I>gs is the lowest energy state of -ffhop within the subspace of iVg up spin 
electrons. Recalling the general eigenstates (2.28) of -ffhop, we find that the desired ferro- 
magnetic ground state is 

$GS= (114?) "^vac, (3.3) 
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with aj I defined in (2.27), which has the ground state energy 

Eierro = /_^^j- (3-4) 

When we have e^^ < En^+i, (3.3) is the unique ground state^^ in the subspace with S'^J = 

(3) 

Smsix- The ground state $m with S^^l = M can be obtained from this ground state by the 
standard relation 

The above elementary construction of the ground states (provided that the model exhibits 
ferromagnetism) is based on the fact that any state with S'tot = •S'max does not feel the on-site 
Coulomb repulsion. This reduces the problem to that of non-interacting spinless fermions. 
We stress that this is a very special feature of the Hubbard model, which has only on-site 
interactions. 

3.3 Instability of ferromagnetism 

To see that ferromagnetism is indeed a delicate phenomenon, we discuss some results which 
show that the Hubbard model with certain conditions does not exhibit ferromagnetism. 

We first look at the non-interacting model with the Hamiltonian H = -f^hop that we stud- 
ied in Section 2.4. Assuming that the single-electron energy eigenvalues are non-degenerate, 
we found that the ground state is uniquely given by (2.30). By noting that the operator 
a]-fa] I is SU(2) invariant and S'tot $vac = 0; ^^ immediately find S'tot $gs = 0- Thus the 
ground state has the total spin S'tot = 0, and there is no ferromagnetism. This is nothing 
but the well-known Pauli paramagnetism. (See Figure 3.) 

It is also trivial that the non-hopping model with H = H-^^^t we studied in Section 2.5 
does not exhibit ferromagnetism. In this case any state (2.32) for L|,L| with minimum 
|L| nLj is a ground state. Thus the ground states are highly degenerate and exhibit a kind 
of paramagnetism. 

Therefore neither -f^hop nor iJint alone favor ferromagnetism. As we have stressed in 
Section 2.6, our hope is that, when these two Hamiltonians are added into a single Hub- 
bard Hamiltonian H = -ffhop + -f^int, their "competition" generates totally new phenomena, 
including ferromagnetism. 



^^Whcn en^ is degenerate with £„,...,£„ (where n < Ng < m), we can replace the states j = n,. . . ,Ne 
with arbitrary states from j = n . . . ,m to get other ground states. The ground states are degenerate even 

(3) 

in the subspace with S^^l = S'max- 

^^That S'tot — also follows from the uniqueness of the ground state. 
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Next we consider the situation in which the Coulomb interaction U is finite but smalL 
We find here that there cannot be (saturated) ferromagnetism. 

Theorem 3.2 (Impossibility of ferromagnetism for small U) Suppose <U< En^ — 
El. Then the ground state of the Hubbard model does not have Stot = S^ax, i-G., the model 
does not exhibit ferromagnetism. Note that the fermi energy En^ — ^i is usually independent 
of the system size for a fixed filling factor v. 

Proof: One of the lowest energy states with S'tot = 'S'max is given by (3.3) and has the energy 
(3.4). Consider a normalized trial state 

^ = «u(n 4t)*™c, (3.6) 

which is obtained from (3.3) by removing the up spin electron with the highest energy and 
then adding a down spin electron with the lowest energy. Noting the SU{2) invariance of 
a{ ^aj p one finds that \1' has S'tot = S'max — 1- We want to evaluate the energy expectation 
value (\E',if\&). For the kinetic energy, we have (\E',iJhop^) = ^i + X]i=i~ ^j- ^^ ^^^^ -^mt, 
we note that the inequality^^ Ux^^ < 1 implies Hi^t < ^XlsGA^^^.i to get (\1', iJint^l') < 
(^, [/X^xeA^^.i^) ~ ^- Therefore we have 

(^, H^) - ^fe„o <Ei-EN^ + U<0, (3.7) 

where -Eferro is defined in (3.4), and the final bound follows from the condition of the theorem. 
From the variational principle, we see that -Eferro is not the ground state energy of H. m 

Although the above theorem ensures that the ground state cannot be ferromagnetic, it 
does not provide any information about the nature of the true ground state of the model. 
To study the latter explicitly is in general a very difficult problem which (for the moment) 
is possible only in the simplest one-dimensional model. 

Finally, we discuss the situation in which the interaction may be large but the density of 
electrons is very low. It is expected that the chance of electrons to collide with each other 
in this case becomes very small. It is likely that the model is close to an ideal gas, and there 
is no ferromagnetism. 

This naive guess is easily justified for "healthy" models in dimensions three (or higher). 
The dimensionality of the lattice is taken into account by assuming that there are positive 
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See Definition C.l for the definition of inequalities for operators. 
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constants c, no, po, and d, and the single electron energy eigenvalues satisfy 

en'e,>c^-^^ , (3.8) 

for any n such that n/Ns < po- Note that the right-hand side represents the n dependence 
of energy levels in a usual d-dimensional quantum mechanical system. Then we have the 
following theorem due to Fieri, Daul, Baeriswyl, Dzierzawa, and Fazekas [26]. 

Theorem 3.3 (Impossibility of ferromagnetism at low densities) Take fl-f^hop which 
has translation invariance (as in Appendix E) and satisfies (3.8) with positive c, Hq, Pq, and 
d > 2. Then there exists a constant pi > 0, and the corresponding Hubbard model does not 
exhibit ferromagnetism for any U > if Nc/Ng < pi holds. 

Outline of proof : The naive trial state (3.6) does not work for large U. We follow [27], and 
consider the Roth state [28] 

^ = Po^, (3.9) 

where "^ is defined in (3.6), and 



Po = l[{'i--n,,^n^,l) (3.10) 



xeA 



is the orthogonal projection (called the 'Gutzwiller projection') onto the space with no doubly 
occupied sites. Because of the projection, the state (3.9) minimizes the Coulomb interaction 
as i/int^ = 0. Thus we only need to evaluate the expectation value of -ffhop- After a tedious 
but straightforward calculation whose details can be found in Appendix F, we find 



£^fcrro < ^1 - BN, + c' p, (3.ir 



where p = N^/Ns is the electron density and c' > is a constant. From the assumption (3.8), 
we find that the right-hand side becomes strictly negative for sufficiently small p provided 
that d > 2. m 

That we have a restriction on dimensionality in Theorem 3.3 is not merely technical. In 
a one-dimensional system, moving electrons must eventually collide with each other for an 
obvious geometric reason. Thus a one-dimensional model cannot be regarded as close to 
ideal no matter how low the electron density is. We do not know whether the inapplicability 
of the theorem to two-dimensional systems is physically meaningful or not. 

^^no and d are the degeneracy of the single-electron ground states and the dimension of the system, 
respectively. 
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3.4 Two more theorems for the absence of ferromagnetism 

We briefly discuss (without proofs) two strong tlieorems wliicli also rule out ferromagnetism. 
The classical Lieb-Mattis theorem [29] states (among other things) that one can never 
have ferromagnetism in the one- dimensional Hubbard model with only nearest neighbor 
hoppings^^. One-dimensional Hubbard models with next nearest neighbor hoppings may 
exhibit ferromagnetism as we explicitly see in Sections 6 and 8. 

Theorem 3.4 (Lieb-Mattis theorem) Consider a Hubbard model with even N^ on a one- 
dimensional lattice A = {1,2, .. . , A's} with open boundary conditions. We assume that the 
hopping matrix elements satisfy \tx,y\ < oo when x = y, < \tx,y\ < oo when \x — y\ = 1, 
and are vanishing otherwise. Then for any real U , the ground state of the model is unique 
and has Stot = 0. 

The next important theorem is due to Lieb [11]. 

Theorem 3.5 (Lieb's theorem, special case) Suppose that the lattice A is decomposed 
into two sublattices as A = AVJ B with \A\ = \B\, and we have t^^y = when x,y E A or 
x,y G B. We also assume that the entire lattice is connected by nonvanishing tx,y. When 
Ng = Ng, the ground state of the model is unique and has Stot = for any U > 0. 

The electron number N^ = Ns is usually referred to as "half- filling" , since 2A'"s is the 
maximum possible number for N^. The low energy properties of a Hubbard model at half- 
filling is believed to be described by the antiferromagnetic Heisenberg model. Lieb's theorem 
stated above gives a partial justification to this belief. This theorem applies to models on 
lattices with \A\ ^ \B\ as well. In this case, it implies the existence oi ferrimagnetism, as is 
explicitly stated in [12]. 

4 Nagaoka's ferromagnetism 

4.1 Weak version of Nagaoka's theorem 

We are now ready to discuss Nagaoka's ferromagnetism^'^ [10]. Briefly speaking, Nagaoka's 
theorem establishes that some Hubbard models exhibit saturated ferromagnetism when the 
number of electrons is one less than the half-filling (i.e., N,, = Ns — 1) and the Coulomb 
repulsion U is infinitely large. Given the general fact that a half-filled system never exhibits 



^^The present theorem appears in the Appendix of [29]. The main body of [29] treats interacting electron 
systems in continuous spaces. 

^°As for the presentation of the results and the proofs, we follow [30], where a generalization of Nagaoka's 
theorem was discussed. The proof briefly described in [30] is now presented in its full detail. 
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Figure 4: Schematic picture of the origin of Nagaoka's ferromagnetism. When the hole hops 
around the lattice, the spin configuration is changed. For a model with t^^^j^ > 0, the hole 
motion produces a precise linear combination of various spin configurations which leads to 
a ferromagnetic state. 

ferromagnetism, this is a rather striking result, which demonstrates that strongly interacting 
electron systems can produce very rich and sometimes surprising phenomena. 

When U = oo and A'^c = A'^g — 1, states with finite energies have no doubly occupied sites, 
and there is exactly one empty site which we call "hole." The basic mechanism of Nagaoka's 
ferromagnetism is that the hole hops around the lattice and generates a suitable linear com- 
bination of the basis states, in such a way that the resulting state exhibits ferromagnetism. 
See Figure 4. Thouless [9] also discussed the same mechanism of ferromagnetism in slightly 
more restricted situations. Also see [31]. 

We start from the following weaker result which is easy to state and prove. 

Theorem 4.1 (Weak version of Nagaoka's theorem) Consider an arbitrary Hubbard 
model with t^^y > for any x,y & A, N^ = Ns — 1, and U = oo. Then among the ground 
states there are (25*^1 + 1) states with total spin Stot = 'S'inax(= Nf,/2). 

Note that the theorem does not establish the existence of ferromagnetism since it does not 
state ferromagnetic states are the only ground states^^. Stronger statement will be proved 
in the next section as Theorem 4.5. 

Let us prove Theorem 4.1. We first consider the treatment of the f/ -^ oo limit. Let us 
decompose the Hilbert space Hn^ as 

nN. = n%l ® n'^^, (4.i) 

■^^ Note that the statement of the theorem is vahd in the trivial model with t^.y — for all x, y, where all 
the possible spin states are degenerate. We must remark that some of the published "proofs" of Nagaoka's 
theorem and its extensions only prove such weak statements. See also footnote 40. 
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f^ ' j^j— vni~ni~i-ri~i j— v-r -rl^i^ r'r t^r r^r''-'^ 



where H}^ consists of the states satisfying Hmt^ = 0. We denote by Pq (see (3.10)) the 

/(O) 
'■Nc 

Lemma 4.2 (Characterization of the U ^ oo hmit) The U ^ oo limit of the Hubbard 

/(O) 



orthogonal projection onto 7Y]y . Then we have the foUowing. 



model is equivalent to studying the Hamiltonian^^ H = PqH-^o^ on the Hilbert space 71} 



Proof: By continuity, the eigenstates oi H = i/hop + -f^int can be classified into those with 
energies diverging as U ^ oo and those with finite energies in the U ^ oo limit. We are 
only interested in the latter. Let $ be an eigenstate with energy E which has a finite limit. 
We can assume both $ and E are continuously parameterized by U. Note that we have 
Pq^ — > $ in the U ^ oo limit, because otherwise E diverges in this limit. By applying the 
projection Pq onto the Schrodinger equation E^ = (/7hop + -f^int)'^, we get EPq^ = Po-f^hop"^ 
because Po-f^int"^ = 0. In the ?7 ^ oo limit, this becomes E^ = H^ for $ G H^^]. m 

We now prepare a basis for the Hilbert space Ti-j^ ■ We use the basis states characterized 
by the position x of the hole and the spin configuration^^ cr = (o"j;)j;eA\:r £ <Sa\x, defined as 



$x,<T = C,,T TT cLr. ^vac = C,a TT cL" "^vac (4.2) 




Here the product is taken over all the sites in A with an arbitrary but fixed order. We 
gave two equivalent expressions for ^x,(t- The spin configurations (o"(^)yeA and (c^Oj/eA ^.re 
essentially the same as cr = (o'y)yeA\x, and defined by CTy = a'y = ay for all y G A\a;. As for 
the missing site x, we set o"^ =T and a'^ =|. In the definition (4.2), we are simply supplying 
an electron at the hole site x, and then annihilating the electron by c^^i or c^ |. This may 
sound meaningless, but we get fermion signs appropriate for our purpose in this way. 

We wish to examine the action oi H = PqH]^^^, on the states (4.2). In order to get a finite 
contribution, an electron must hop into the hole site x from other sites. So we examine the 
action of Ylia=-\ i '^i.o-'^z.o- on (^x,(t- Using the two equivalent expressions (4.2) for ^x,<ti we get 






/ , cl,^Cz,a^x,<T = -C2,tn^,T I TT 4,<T,', 1 "^vac - C^,l^'. 





^^We are assuming iVo < Ng. If A^o > ^s, the relevant condition is replaced by Hmt^ ~ U{Nc — ^s)*&, 
and Lemma 4.2 is still valid. 

^■^In the present case, we will eventually find that the ground states in the U -^ oo limit are eigenstates 
of -ffhop- This is of course an accidental situation for the ferromagnetic states. Examples of exact ground 
states in the U ^ oo limit which are not eigenstates of iJhop can be found in [32, 33]. 

^"^By A\x we mean the lattice obtained by removing x from A. 

26 



n ^u ] "^-^ - "^'i ( n 4,< ) ^^ 

yyeA / \«GA / 



%,^, H^^^„) = \ ^ ^-^y '^Zzll'^''' (4-4) 



\yeA / \yGA 

= -<^.,<.._, (4.3) 

where cTz^x ^ '5a\z is the new spin configuration on A\z obtained from cr by moving a^ to 
X. Note that in the second fine of (4.3), only one of the two terms survive depending on the 
value of (Xz- Therefore the matrix elements of the effective Hamiltonian H are given by 

~^x^y II / C/y 

otherwise. 

Let $GS be a ground state of H. Since it has a finite energy, it is expanded as 

*GS = $^ Yl v{x,(t)<I>,,^. (4.5) 

Since the matrix elements (4.4) of H are real, we can assume that the coefficients '^{x, cr) 
are real^^. We define 

e.= ( E (v(^^^)n ' (4-6) 

\creSA\x ) 

and a ferromagnetic state 

^T = 5Z^-'^-'(T)' (4-7) 

xeA 

where (|) denotes the spin configuration with all spins up. 
By using (4.5) and (4.4), we find 

$GS,^'^Gs) = 5^ 5^ v{y^T)if{x,rT)l^y^^,E^^^, 

x,yeA creSA\x 

= - Yl ^^^y Y ^^y^ o-y^x)v{x, cr) 

x,y£k creSA\x 

> -J2^^^y\ 5^ {v'(^''^2/-^)}M I 5^ {v'(^.'^)}' 

x,yeA \o-eSjy\x J \(r(^SA\x 

= ~ / ^ tx,y^y^x 
x,y^A 

= ($|,if<l>|), (4.^ 



^^If this is not the case, we redefine {(f{x,(T) + ip{x,(T)*}/2 as (p(x, cr). The corresponding <i>Gs is also a 
ground state. 
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where we have used the Schwartz inequahty (with the assumption tx^y > 0) to get the 
third hne. This bound shows that $| is also a ground state. This completes the proof of 
Theorem 4.1. 

4.2 Nagaoka's theorem 

We now state a stronger and the most general version of Nagaoka's theorem. Under an 
additional condition (which can be easily verified in some typical cases), we will prove that 
the ferromagnetic states are the only possible ground states. To determine the additional 
conditions, it is better to start from mathematics. 

Let us recall the Perron-Frobenius theorem, which is standard in linear algebra. The 
following is the simplest version of the theorem^^. 

Theorem 4.3 (Perron-Probenius theorem for a real symmetric matrix) Let M = 

{^TT'i,j)i,j=i,...,N be an N X N real symmetric matrix (i.e., rriij = rrij^i eM. ) with the properties 

that 

i) f^i,j ^ for any i ^ j . 

ii) All i ^ j are connected via nonvanishing matrix elements of M. More precis ely^"^ , for 

any i ^ j , we can take a sequence (ii, . . . , i^) such that ii = i, ix = j , and rriij^^ij,^^ ^ for 

all k < K. 

Then the lowest eigenvalue of M is nondegenerate and the corresponding eigenvector 
V = (f j)i=i,...,Af can he taken to satisfy f « > for all i. 

Proof: Let us present a standard elementary proof based on a variational argument. The 
essence of the argument is that a state without "nodes" has low energy. This idea is familiar 
in quantum mechanics^^. 

1) We first prove that if an eigenvector u = ('Uj)j=i_...^7v of M satisfies Ui > for any 
i, then it inevitably satisfies Ui > for any i. To do this, we assume the converse, i.e., 
Ui > for all i, and Uj = for some j. Then from the eigenvalue equation, we see that 
^^ rrij^iUi = fiUj = 0. Since rrij^iUi < 0, this means Mj = for all i with m^ j ^ 0. Because we 
have the connectivity ii), we can repeat this argument until we see Mj = for all i, which is 
a contradiction. 

2) We then prove that a normalized eigenvector v = (fi)j=i,...,Af for the lowest eigenvalue 
fiQ can be chosen to satisfy Vi > for all i. Since all rriij and /iq are real, we can assume 

^^The full version of the theorem applies to non-symmetric matrices as well. See, for example, page 130 
of [34]. 

■^^Another way of stating the condition is that, for any i ^ j, there is K such that {M^)ij ^ 0. 
^^See, for example, Section 20 of [35]. 
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that the Vi are all real. With 1) in mind, we assume the converse, i.e., Vj > and f ^ < 
hold for some j and k. Define u = ('Ui)i=i,...,Ar by Ui = \vi\. Then from i), we see 

N N 

/io = ^ ViiriijVj > ^ UirUijUj. (4.9) 

Since /iq is the lowest eigenvalue, the right-hand side must also be equal to fiQ, which means 
that u is an eigenvector of M. Then the above 1) implies Ui > 0, and hence f » 7^ for all 
i. Recalling the connectivity ii), the latter property implies we can find i and j such that 
ViTJiijVj > 0. Then we have UiiriijUj < for the same i and j, which means (4.9) is valid 
with > replaced by >. This contradicts with the assumption that /iq is the lowest eigenvalue. 
3) Finally suppose that the lowest eigenvalue of M is degenerate. Then we can find two 
mutually orthogonal eigenvectors u and v. But from 2), we must have u ■ v 7^ 0, which is a 
contradiction. ■ 

We wish to apply this theorem to the present problem of the U = 00 Hubbard model 
with a single hole. As for the matrix M, we take the matrix representation (4.4) of the 
Hamiltonian for the basis states (4.2) with a fixed S^^l = X]«eA\x '^y Because of (4.4) and 
the assumption tx,y > 0, the condition i) of Theorem 4.3 is satisfied. The condition ii), on 
the other hand, is not always valid. This motivates us to consider the following connectivity 
condition. 

Definition 4.4 (Connectivity condition) A Hubbard model with U = cxd and N^ = Ng — l 
(or more precisely the hopping matrix T ) is said to satisfy the connectivity condition if all 

the basis states ^x,cr with common S'^J = Xl^eAVi: "^y ^'^^ connected with each other through 
nonvanishing matrix elements of H . 

As we shall see in the next section, the connectivity condition can be easily verified 
in the Hubbard model with nearest neighbor hoppings on most standard lattices including 
triangular, square, simple cubic, fee, and bcc. 

Consider a model which satisfies the connectivity condition. Then we can readily apply 
the Perron- Frobenius theorem to see that the ground state in each subspace with a fixed 
Slol is unique. Then Theorem 4.1 implies that this ground state must be ferromagnetic. So 
we have proved the following, which is the full (and most generalized) version of Nagaoka's 
famous theorem. 

Theorem 4.5 (Generalized Nagaoka theorem) Consider an arbitrary Hubbard model 
with tx,y > for any x,y & A, Nc = Ng — 1, and U = 00, and further assume that the model 
satisfies the connectivity condition. Then the ground states have total spin Stot = 'S'inax(= 
iVc/2), and are non- degenerate apart from the trivial (2S'max + ^)-fold degeneracy. 
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X y 

Figure 5: Exchange bonds in the "delta-chain." The horizontal bond {x, y} is not an exchange 
bond since the site z is disconnected from the rest when x and y are removed. The bond 
{x, z} is an exchange bond since the lattice remains connected after the removal of x and y 
provided that we use periodic boundary conditions. 

As stressed earlier, the knowledge that all the ground states are ferromagnetic is of 
fundamental importance. The theorem is of course no longer valid for the trivial model with 
tx,y = for all X and y. Since the theorem asserts the nondegeneracy of the ferromagnetic 
ground state for U = oo, the continuity of energy eigenvalues in U implies that the statement 
of the theorem is valid also for sufficiently large but finite U. However, we have no meaningful 
estimates of how large U should be. 

4.3 Connectivity condition 

We still have to verify the connectivity condition for some systems to make Theorem 4.5 
meaningful. It seems, however, that to write down a simple necessary and sufficient condition 
for the connectivity condition is a nontrivial problem. We here follow Nagaoka's original 
spirit, and provide a constructive criterion (i.e., a sufficient condition) for the connectivity 
condition. 

Let us introduce some terminology. By a loop of length m, we mean an ordered set 
(xi, . . . , Xm) of sites such that tx^^xi+i 7^ for alH = 1, . . . , m — 1, and tx^,xi 7^ 0. We say 
that a pair {x, y} of lattice sites is an exchange bond if they belong to a loop of length three 
or four, and the whole lattice remains connected via nonvanishing tx,y when the sites x and 
y are removed. (See Figure 5.) 
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Then we have the following sufficient condition for the connectivity. 

Lemma 4.6 (A sufRcient condition for the connectivity) // the whole lattice is con- 
nected by exchange bonds, then the model satisfies the connectivity condition. 

This sufficient condition can be easily verified in various systems. In models defined on 
regular lattices like triangular, square, simple cubic, fee, or bcc lattices with nonvanishing 
hopping amplitudes between nearest neighbor sites, it is obvious that all the nearest neighbor 
bonds are exchange bonds. Thus they trivially satisfy the condition. A less trivial example 
is the delta chain with periodic boundary conditions in Figure 5. Although the horizontal 
bonds are not exchange bonds, the whole lattice is connected via non-horizontal bonds, 
which are exchange bonds. 

Proof of Lemma 4-6: Suppose that we are given an arbitrary configuration of Ac = Ag — 1 
electrons on A. Our goal is to show that we can get an arbitrary configuration with the same 
'S'tot by moving the single hole along nonvanishing t^^y 

Let {x, y} be an exchange bond. We show below that we can exchange the spins at sites x 
and y without changing the configuration outside {x, y}. Since the whole lattice is connected 
via exchange bonds, this means we can generate any permutation of spin configurations by 
successive exchanges on the exchange bonds. This proves the connectivity condition. 

We now prove the desired property of exchange bonds. Let {x, y} be an exchange bond, 
and assume that x and y are occupied by electrons with opposite spins. We ffist bring the 
hole (by successive hops outside {x,y}) to a site other than x or y on the loop (of length 
three or four) that contains {x,y}. Next we let the hole move along the loop until the spins 
at X and y are exchanged. In a loop of length three, this is realized after the hole goes around 
the loop once, as in Figure 6. In a loop of length four'^^, we have to move the hole along the 
loop once or twice, depending on the spin configuration, as in Fig 7. Finally, we bring the 
hole back to the original location. By following the same path as in the ffist step backwards, 
we recover exactly the same configuration, except on sites x and y. m 



4.4 Stability of Nagaoka's ferromagnetism 

It is desirable to extend Nagaoka's ferromagnetism to systems with finite U and with finite 
densities of holes. Unfortunately, the Perron-Frobenius argument which works for the one- 
hole case fails even for models with two holes. There is a considerable number of rigorous 
works (including that in Nagaoka's original work [10]) which establish that saturated ferro- 
magnetism does not take place in certain situations. See, for example, [36, 27, 37, 38, 39, 24]. 
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The exchange on a length four foop is possible because electronic spins take (only) two values. 
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Figure 6: The two spins are exchanged when the hole hops around the loop once. Such 
an "exchange process along a triangle" appears repeatedly in various examples of ferromag- 
netism, and is regarded as a universal and fundamental mechanism leading to ferromagnetism 

[21]. 




Figure 7: The spins at x and z are exchanged when the hole hops around the loop once. 
The spins at y and z are exchanged when the hole hops around the loop twice (or once in 
the opposite orientation). 
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Most of these works are essentially based on variational arguments where one constructs so- 
phisticated variational states which have lower energies than the ferromagnetic state. (We 
have seen the most elementary versions of the argument in Section 3.3.) As far as we know, 
there are no rigorous results about the stability of Nagaoka's ferromagnetism'^'^ in the Hub- 
bard model. 

There is also a considerable number of numerical and theoretical works which indicate 
stability or instability of Nagaoka's ferromagnetism in various situations [42, 43, 44, 45]. 

In spite of all these efforts, it is still not known if straightforward extensions of Nagaoka's 
ferromagnetism to finite U and finite hole density are possible. It seems that a recent 
dominant opinion is that the possibility of extension depends strongly on lattice structures, 
where the triangular lattice is regarded as one of the most promising candidates. The 
importance of lattice structures is closely related to the approach from flat-band models 
that we discuss in Section 6 

5 Ferromagnetism in the Hubbard model with long- 
range hopping 

5.1 Main statement and preliminary results 

There is at least one situation in which the extension of Nagaoka's ferromagnetism to finite 
U is possible. This is an artificial model with long range hopping that we shall now define. 
We associate with each site x a constant A^; > 0, and define the hopping matrix T = 
(tx,y)x,yeA with long range hopping amplitudes by'^^ 

tx,y ^ t-^x-^yi \'^-^) 

where t > is a constant (which does not play any essential role). Note that an electron 
can hop from any site in the lattice to any other site (Figure 8). Then we have the following 
extension of Nagaoka's theorem. 

Theorem 5.1 (Ferromagnetism in the long-range hopping model) Consider the Hub- 
bard model with the hopping (5.1) with electron number N^ = N^ — 1. For any U > 0, the 
ground states have total spin Stot = 'S'inax(= A^e/2), and are non-degenerate apart from the 
trivial {2Sma.x + I) -fold degeneracy. 

*° Interesting lower bounds for the ground state energy of the U = oo model with multiple holes are proved 
in [40, 41]. We do not, however, interpret these bounds as "proofs of stability of Nagaoka's ferromagnetism" 
since the bounds do not rule out paramagnetism (e.g., in the trivial model with tx,y = for any x,y). See 
also footnote 31. 

"^^In coordinate- free notation, this becomes T = tX (^ \, where is understood as a Kronecker product, 
and A = {Xx)xeA- As for this construction, the Dirac notation T = t|A)(A| may be more informative. 
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Figure 8: In the long-range hopping model, hopping between two arbitrary sites in the lattice 
is possible. The model serves as an intermediate step in our attempt to relate Nagaoka's 
ferromagnetism and flat-band ferromagnetism. 



At first glance, the theorem looks surprising and attractive since the precise statement 
of Nagaoka's theorem is extended to an arbitrary positive value of U. As we shall see below, 
this is indeed a consequence of a very special property of the present model. 

With the hopping (5.1), the action of T onto any c^ G [) becomes Tip = tX (A, (p), where 
A = (Aa;)a;gA G f). Thus we find {(p, Tip) = t {(p, A) (A, (p) = \ (A, (p) p > 0, which means that 
T > 0. On the other hand, the single-electron Schrodinger equation (2.2) becomes 



e(p = tX (A, cp) . 



(5.2) 



This means that any (p orthogonal to A is an eigenstate of (5.2) with e = 0, which is the 

minimum possible eigenvalue. Clearly, there are {Ns — l)-fold degenerate eigenstates with 

e = 0. The remaining eigenstate is A itself, and it has the eigenvalue e = tJ^x&Ai'^^y > ^■ 

It is crucial for the electron number Ne to be set equal to the dimension iVg — 1 of the 

above degeneracy. We see from (3.4) that the lowest energy among ferromagnetic states is 
F — v^^s-i ^ _ n 

-'-'fcrro / jj—i t-i "• 



=1 



On the other hand, it is clear from (2.29) that the lowest eigenvalue of -f/^hop is if 
Nc < 2(A''s — 1). Hence we can write H^op > for N^ = Ns — 1. (See Definition CI 
and Lemma C.2.) We also have ifmt > in general. Thus (by using Lemma C.4) we get 
H = -f^hop + -f^int > 0. This, along with the fact that there is a ferromagnetic state with 
energy zero, implies that the ground state energy of if is 0. We have thus found that there 
are ferromagnetic states among the ground states. 

Let U > 0, and let $gs be an arbitrary ground state. Then (from Lemma C.6) i/hop > 0, 
Hint > 0, and //$gs = (^hop + i^int)$GS = imply 



-ffhnn* 



hop^^GS 



0, i?i.t$ 



int'i'GS 



0. 



(5.3) 
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In other words, the ground state happens to be a simultaneous ground state^^ of -f^hop and 
iJinf This is of course a very special property of the present model with singular degeneracy 
in the single-electron eigenstates. Since [/ 7^ 0, the second relation in (5.3) implies 

^n^,^n^,i = ^- (5-4) 

xgA 

5.2 First proof 

We give a proof of Theorem 5.1 which makes explicit use of Nagaoka's theorem. Since we 
have seen that there are ferromagnetic states among the ground states for f/ > 0, we only 
have to show that they are the only ground states for U > 0. 

The ground states for any U > are fully characterized by iJ^op'^GS = and (5.4). 
Noting that (5.4) implies /7int$GS = for any f/ > 0, we see that the ground states of a 
given U > remain as ground states for any values of f/ > 0, and hence in the limit U -^ 00. 
Since the ground states for f/ — > 00 are completely characterized by Nagaoka's theorem 
(Theorem 4.5), we have proved Theorem 5.1. 

5.3 Second proof 

We describe another proof of Theorem 5.1. This proof is certainly more involved than the 
first one, but provides us with an entirely different physical picture of the problem. In 
the new picture, ferromagnetism in this model is generated by an "exchange interaction" 
among the spins of electrons which are "frozen" in certain single-electron states. We recall 
that the first proof is based on Nagaoka's theorem, which suggests a dynamical picture 
that ferromagnetism is generated by the motion of the hole. It is interesting that the two 
totally different pictures apply equally to the present model. As we shall investigate in the 
next sections, the new picture will lead us to a new class of ferromagnetism now known as 
'flat-band ferromagnetism.' 

We start by constructing a (non-orthonormal) basis for the space of degenerate single- 
electron ground states. Fix an arbitrary site xq G A, and denote by A' = A\xo the lattice 
obtained by removing Xq from A. For each y G A', we define a single-electron state cp*^^^ = 

{^x^)xeA e f) by 

r A^u iix = y; 

V^^^ = < -\y iix = xo; (5.5) 

L otherwise. 

^^Of course the existence of one simultaneous eigenstate does not imply -ffhop and Hi^t are simultane- 
ously diagonalizable. Recall that quantum mechanical angular momenta have simultaneous eigenstate with 
vanishing eigenvalue. 
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This state satisfies ^A, cp'-^^) = 0, and hence it is a single-electron ground state. Since the 
states (^'^^-' with y E A' are linearly independent, they span the entire space of the single- 
electron ground states. We define the corresponding fermion operators hj b'^ = C^(<^''^^) 
for y E A' and a =1, |, which satisfies [ifhop, &y,o-] = because of the general commutation 
rule (2.23) and the definition (2.26) of -f^hop- 

Let us characterize an arbitrary ground state <I>gs for U > in a constructive manner. 
Because ifhop'^GS = 0, the ground state <I>gs should consist only of the (single-electron) zero 
energy states. Therefore it is written as*^^ 

*Gs = Yi /(^T, h) I n ^i,T I ( n ^L "^-c, (5.6) 

S.t.|L|| + |LJ=Afe 

where the /(L|,Lj) are coefficients, and L|, Lj are summed over all the subsets of A' such 
that |L|| + |LJ = Ne = Ns — 1. Each state in the sum of (5.6) is nonvanishing because of 
Lemma 2.L 

Since Ux^iUx^i > 0, the condition (5.4) (and Lemma C.6) indeed implies a stronger state- 
ment that ^x,t^a:,i'^GS = for any x E A. By further rewriting this relation as 
(cx,j.Cx,t)^(cx,iCx,t)'^GS = (and using Lemma C.7), we find that the ground state must 
satisfy 

Cx,iCx,^^GS = 0, (5.7) 

for each x E A. 

We take x E A' and examine what (5.7) implies about (5.6). Using the anticommutation 
relation {cx,a,bly} = \xoSx,ySa,a' , we obtain 

CxaCx,T*GS = ^ x[^ e L| n LJ /(L|, L^) sgn[x, L|, Lj x 



(5.^ 



where x[true] = 1, x[false] = 0, and the factor sgn[a;, L|, LJ = ±1 comes from the exchange 
of fermion operators. Since all the terms in the sum of (5.8) are linearly independent, the 
condition (5.7) implies /(L|,Lj) = for any L-^,Lj^ such that x E L-^ (1 L\^. Using this for 
all X E A', we finally see that f{L^^ L|) = unless L| fl L| = 0. Since |L|| -|- |Lj^| = |A'|, the 

^^In a more careful proof, one uses Lemma 2.3 with Ns linearly independent states ip^y'> {y e A') and A to 
construct a basis of the entire Hilbert space TCn^ , formally writes $gs a-s a linear combination of all the basis 
states, and finally uses the condition -ffhop^^GS = to see that the terms including C^(A) must be vanishing. 
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condition L| n Lj^ = implies L| U Lj^ = A'. This means that we can reorganize the sum 
(5.6) as 



(tg5./ \j/GA' / 



^, ^ ^ $vac, (5.9) 

\y£A' 

where the sum is over all the spin configurations cr = {(yy)yeK on A', and g[(T\ is a new 
coefficient. The representation (5.9) suggests a new physical picture that the motion of the 
electrons is "frozen", and only spin degrees of freedom are left. Note, however, that this 
interpretation is somewhat arbitrary since the site Xq is chosen arbitrarily. 

Now we consider the condition (5.7) with x = Xq. By using (5.9) and the anticommutation 
relation {cx,,,a,bly} = -XyS^y for any y G A', we get 

= Yl Yl sgn[x,y]tK\y{g[(T]-g[(T.,^y])l Yl 6i^, 1 <l>,ac, (5.10) 

x,yeA' (t£S/^, \^zeA'\{x,y} J 

s.t. x>y s.t.ax=h(Ty=l 

where we have introduced arbitrary ordering in A', and sgn[a;, y] = ±1 is again the fermion 
sign^^. The spin configuration cr^^y is obtained by switching cr^ and cTy in the original 
configuration cr. 

Since all the states in the sum (5.10) are linearly independent, the condition (5.7) is 
satisfied only when 

g[(T] = g[(T^^^y] (5.11) 

holds for all cr and all x,y E A'. Note that we have deduced a kind of "exchange interaction" 
(5.11) among "localized spins" from the no-double-occupancy condition (5.7). 

The exchange relation (5.11) implies that all the g[a] with common S^^l = Xl«eA' ^?/ 
assume exactly the same values^^. This means that the lowest energy state in the space with 
a fixed S'tot is unique. Since we already know that there are ferromagnetic ground states, 
this completes the second proof of Theorem 5.1. 

6 Flat-band ferromagnetism 

6.1 Definition 

We have seen that the Hubbard model with artificial long range hopping exhibits ferromag- 
netism for arbitrary U > 0. As we shall see in the present section, it is possible to assemble 

'*'*A\{j/, z} is the set obtained by removing y and z from A. 

''^Although it is not necessary in the present proof, one can show directly that such a state has S'tot = •S'max- 
See proof of Theorem 8.1. 
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many (identical) copies of the long-range model to get Hubbard models with only short 
range hoppings which still exhibit ferromagnetism [7, 8]. Unlike Nagaoka's ferromagnetism, 
the electron density is away from half-filling, and U need only to be nonvanishing. On the 
other hand, these models have very singular band structure which is expressed by the name 
"fiat-band models." 

Let us define our model by means of a "cell construction^^." Our lattice A can be written 
as 

A = Ci U ■ ■ ■ U Cm, (6.1) 

where each Cj is called a cell. Each cell consists of a single internal site and n external sites, 
where n > 2 is a constant. The simplest cell is a triangle with n = 2. The models to be 
constructed will again involve "exchange processes along a triangle." This seems to be a 
universal and fundamental mechanism of ferromagnetism [21]. See Figure 6. 

When assembling M cells to form A in (6.1), we identify some external sites from dif- 
ferent cells^^ to regard them as a single site in A. See Figure 9. The lattice A is naturally 
decomposed as A = X U £^, where X is the set of internal sites, and S is the set of external 
sites. 

We define the hopping amplitudes for the j-th cell by 

t^l = t>^^>^\ (6.2) 

where 

{A if a: is the internal site of Cj; 
1 if X is one of the external sites of Cj] (6-3) 

otherwise. 
Here t > and A > are constants. Note that this is the same as the hopping amplitudes 
(5.1) of the long range model. An essential difference, which makes the present model less 
artificial, is that we have this type of hopping only within each cell. The total hopping is 
defined as 



M 

=1 



i^^=y.^- (6.4) 



We finally assume that the whole lattice is connected via nonvanishing t^^y We then define 
the Hubbard model on A with the hopping matrix given by (6.4). 

Theorem 6.1 (Flat-band ferromagnetism) Consider the above Hubbard model with the 
electron number Nc = \S\. For any U > 0, the ground states have total spin Stot = 'S'max(= 
Nc/2), and are non- degenerate apart from the trivial (2S'max + l)-fold degeneracy. 

*^A class of exactly solvable Hubbard models with very similar cell structures (with paramagnetic ground 
states) was discovered by Brandt and Giesekus [32] before the present model for flat-band ferromagnetism 
was introduced. See also [33, 46] and references therein for further results concerning this model. 

^^There can be external sites left unidentified. 
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Figure 9: Examples of cells and lattices. Gray dots represent internal sites. From the 
triangular cell with three sites (A), one can form (a) the delta-chain by identifying two 
external sites, or (b) a decorated square lattice by identifying four. From the cell with 
five sites (B), one can form (c) another decorated square lattice. There are many similar 
examples in higher dimensions. The flat-band Hubbard model on these lattices exhibits 
ferromagnetism for any U > when the filling factor u = Nc/{2Ns) is equal to (a) 1/4, 
(b) 1/6, or (c) 1/4. 
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Note that the existence of ferromagnetism has been estabhshed in models with finite U, 
finite-range hoppings, and electron densities away from half-filling. As one might guess, the 
straightforward (and general) methods using the Perron-Frobenius theorem (as in the proof 
of Nagaoka's theorem) do not work for this problem. The proof will make use of very special 
properties of the model. This ferromagnetism is closely related to Nagaoka's ferromagnetism 
(at least) via the ferromagnetism in the long-range hopping model, but it certainly belongs to 
a new class of ferromagnetism. This new class is now known as "flat-band ferromagnetism" 
for a reason which we shall explain in Section 6.3. 

6.2 Examples 

Clearly the cell construction leads to a wide variety of models''^. Let us discuss a class of 
examples which is constructed from the most elementary triangular cell with n = 2. 

We construct the rf- dimensional L x . . . x L hypercubic lattice with periodic boundary 
conditions by assembling together the triangular cell (A) of Figure 9 regarding it as a basic 
bond (with length 1). The resulting lattice A is the decorated hypercubic lattice with 
{d+ 1)L'^ sites. The set S of external sites can be identified with the ci-dimensional Lx . . .xL 
hypercubic lattice with L'^ sites. The internal sites in X are the decorating sites at the center 
of every bond. The lattices (a) and (b) in Figure 9 represent parts of A for d = 1 and 2, 
respectively. 

Then the hopping amplitude of (6.4) becomes 



(6.5) 



Note that there are next-nearest-neighbor hoppings between nearby external sites. 

The electron number specified in Theorem 6.1 is A'^c = L'^. In terms of the filling factor, 
this corresponds to z/ = A'c/(2A's) = {2{d + 1)}"^ 

6.3 Single-electron problem 

The present models have peculiar degeneracy in their single-electron ground states which 
are quite similar to that in the long range hopping model of the previous section. 
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if |x- 1/1 = 1/2; 
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2dt 
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a X = y e I; 
ii X = y e £; 
otherwise. 







■^^It is not necessary to have the same n or A for all the cells. This further extends the possibility of 
models. 
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Figure 10: The components of the single-electron ground state ip'^^^ in the model b) of 
Figure 9. The state is localized around the site y, which is at the center of the figure. 



We first note that the hopping matrix T defined by (6.4) satisfies T > 0. This is because 
each (tx,j/)x,j/eA determined by (6.2) is positive semidefinite (which fact is clear from the dis- 
cussion in Section 5.1), and the sum of positive semidefinite matrices is positive semidefinite 
(as in Lemma C.4). 

We define for each y E S the single-electron state (p*^^) = {i^x )xeh by 



V 



{y) 



A 



.0 



if X is the internal site of one of the cells containing y\ 
otherwise. 



(6.6) 



See Figure 10. Observe that we have ( A'--' , c^'-^^ ) ~ ^^^ ^^y J = '^i ■ ■ ■ iM and y E £, where 
a'--'-' = (Xx )xga e t), and hence 

M 



(6.7) 



Since (p^^^ with y G £ are linearly independent, the single-electron Schrodinger equation 
(2.2) has |£^|-fold degenerate ground states^^ with e = 0. It is needless to say that such a 
huge degeneracy is accidental, and makes the present models very special. We also stress 
that such a degeneracy is lifted by an arbitrarily small generic perturbation applied to the 
hopping matrix. 

In models with translational invariance, the degeneracy in the single-electron ground 
states corresponds to the lowest band being completely dispersionless, or "flat." For example. 
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It is not hard to prove that A^-*^ with j ~ 1, . . . , M span the remaining space with e > 0. 
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in the models on the d- dimensional hypercubic lattice of Section 6.2, we find that there are 
d + 1 bands with dispersion relations 

fO ifj = l; 

e,{k) = \\H ifj = 2,...,d; (6.8) 

[(A2 + 2rf)t + t^;;^,cosA;^ iij = d+l. 

Note that the lowest band and the {d— 1) middle bands are completely fiat, while the upper 
band is dispersive. For definitions of bands and dispersion relations as well as an explicit 
calculation for the d = 1 case, see Appendix E. 

6.4 Proof 

We now prove Theorem 6.1. Interestingly, the proof is completely analogous to and not 
more difficult than the second proof of the (less attractive) Theorem 5.1 for the long range 
hopping model. 

Since the single-electron ground states of the model are |£^|-fold degenerate, and the 
electron number is set to A^c = |^|, all the preliminary considerations in Sections 5.1 and 5.3 
apply to the present model as well. Any ground state $gs satisfies the conditions (5.3), and 
(5.7) for any a; G A. 

Again introducing 6j^^^ = C].((^*^^^) for y E £, and using i/hop'^GS = 0, we can represent 
any ground state as 

$Gs = Yi /(^T, ^i) ( n ^Lt I ( n K,i I *-, (6.9) 

S.t.\Ll;\ + \Ll\=No 

where /(-^t, -^j,) are coefficients, and L|, L^ are summed over all the subsets of S such that 
|L|| + |L|| = A^e = 1^1- 

Exactly as before, the condition (5.7) ioi x E S shows that there can be no "double 
occupancies" in b^ , i.e., /(I>|, L^) = whenever L| U L^ 7^ 0. We again reorganize the sum 
to get a spin system representation 

a-eSs \yeS J 

where the sum is over all the spin configuration cr = {o'y)y(z£ on S, and g[(T] is a new 
coefficient. 

Next we examine the implication of the condition (5.7) for a; G X on the ground state 
(6.10). Unlike in the long range hopping model, there are still many relations to use. Let S^ 
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be the set of external sites contained in the cell which contains an internal site x. By using 
(6.10) and the anticommutation relation {cx,cr^by a'} — ~^<T,a'X[y ^ ^x] for any x eT and 
y E S, we get 

C^,iC^,T*GS= Yl Yl Sgn[x,|/](^[o-]-^[o-j,^,]) I Yl ^U^Nvac- (6.11) 

j/.zG^a: o-eSe \ue£\{y,z} j 

s.t. y>z s.t.(Ty=t,o-2=J. 

Since this quantity vanishes for all a; G X, we finally find that 

g\cT\=g\cjy^,\ (6.12) 

for any y,z E 8 which belong to a common cell. Note that the "exchange interaction" is 
short range because we are treating models with short range hoppings. Since the entire 
lattice is connected, (6.12) ensures that the lowest energy state is unique in each sector with 
a fixed S'tot. This completes the proof of Theorem 6.1. 

6.5 Mielke's flat-band ferromagnetism 

A slightly different class of Hubbard models which have highly degenerate single-electron 
ground states and exhibit ferromagnetism was discovered by Mielke [14, 15] before the above 
models were introduced. Here we briefly summarize Mielke's beautiful construction based 
on graph theoretic notion. For a general theory of flat-band ferromagnetism obtained by 
Mielke, see [47]. 

We start from abstract notation. Let G = {V, E) be an abstract graph, where V is the 
set of vertices (sites) a,P, . . . G V, and E is the set of edges (bonds) which are nothing 
but pairs of vertices like {a,f3}. Given a graph G, one can construct the corresponding line 
graph L{G) = (Vl, -^l) by the following procedure. The set of vertices (sites) Vl (whose 
elements are denoted as x, y, . . . G Vl) is taken to be identical to the set E. This means 
that we identify edges in G with the vertices (sites) in L{G) as, for example, x = {a,P}, 
y = {a, 7}, etc. Next we declare that two vertices x,y E Vl are adjacent to each other if 
the corresponding two edges in E share a common vertex in V. The vertices x and y in the 
above example are adjacent to each other since the corresponding edges in E have a common 
vertex a. Ei^ is the set of edges (bonds) in L{G), which consists of all the adjacent pairs 
(like {x, y}) of vertices (sites) in Vl. Finally we set M{G) = |-E| — |V^| + 1 if G is bipartite^", 
and M{G) = |-E| — |V^| if G is non-bipartite. 

^^G is bipartite if it can be decomposed into two disjoint sublattices as G = A U i? with the property that 
any edge in E joins a vertex in A with a vertex in B. 
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Figure 11: The kagome lattice is the hne graph of the hexagonal lattice. Mielke showed that 
the Hubbard model on the kagome lattice exhibits ferromagnetism when the filling factor is 
V = 1/6 for any U > 0. This is the most beautiful example of fiat-band ferromagnetism. 

We define the Hubbard model on the line graph L{G). With each site x G Vl, we 
associate the fermion operator Cx,a, and consider the Hamiltonian 

where t > is a constant. Then the main result of [14] is the following. 

Theorem 6.2 (Mielke's fiat-band ferromagnetism) Suppose that the graph G is twofold 
connecte(f^ . Consider the above Hubbard model with Ne = M{G). Then for any U > 0, the 
ground states of the model have total spin Stot = 'S'max(= Nc/2), and are nondegenerate apart 
from the trivial (2S'max + l)-fold degeneracy. 

This theorem applies to the Hubbard model defined on various line graphs, a typical one 
being the kagome lattice of Figure 11. 

6.6 Beyond flat-band ferromagnetism 

Let us once again take a look at the mechanism by which ferromagnetism is generated in 
a fiat-band model. Refiecting the bulk degeneracy in the single-electron ground states, the 
ground states of the non-interacting model with H = i^hop are highly degenerate. There are 

^^A graph is twofold connected if and only if one cannot make it disconnected by the removal of a single 
vertex. 
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ground states for any possible values of total spin S'tot, including the smallest Stot = or 1/2 
and the maximum S'tot = 'S'max(= ^c/2). When we introduce the Hubbard interaction -ffmt, 
the energy of the ferromagnetic ground states (of -f/^hop) do not change since ferromagnetic 
states do not feel on-site repulsion. These observations are indeed trivial. A truly nontrivial 
point (which is proved by imposing some conditions on models) is that all the other ground 
states of -f^hop receive extra energy from H^^t, and the ferromagnetic ground states become 
the only ground states oi H = -f^hop + -f^int- 

In this sense, the flat-band ferromagnetism certainly takes into account nontrivial inter- 
play of -ffhop and iJint- However, there is no true "competition" between i/hop and i/mt- The 
ferromagnetic ground states are already present in the U = model among highly degener- 
ate ground states. The only role of the interaction is to lift the degeneracy, and "select" the 
ferromagnetic ground states as the only ground states. This is why flat-band ferromagnetism 
takes place for any U > 0. 

As we have seen in Theorem 3.2, we never realize saturated ferromagnetism for values 
of U which are too small in a system without bulk degeneracy in the single-electron ground 
states. Of course, the bulk degeneracy in a flat-band model is far from being robust, and it is 
easily destroyed by adding to the hopping matrix T an arbitrarily small generic perturbation. 
Then, an essential question is whether ferromagnetism found in a fiat-band model remains 
stable after adding a small perturbation to T which makes the lowest band "nearly fiat." 
If the ferromagnetism were unstable against perturbations, we would have to conclude that 
fiat-band ferromagnetism is a mere mathematical game. 

It was conjectured [7, 8] that the fiat-band ferromagnetism is stable against small per- 
turbations to the hopping Hamiltonian. Among the main arguments used in that discussion 
was that an approximate low-energy effective Hamiltonian of the fiat-band model has the 
precise form of the ferromagnetic Heisenberg spin system. Kusakabe and Aoki [48, 49] made 
the first systematic study about stability of fiat-band ferromagnetism. They argued that 
the fiat-band models possess spin-wave excitations which have healthy dispersions, and this 
fact guarantees the robustness of the fiat-band ferromagnetism. They also found numerical 
evidence that ferromagnetism remains stable if we add sufficiently small perturbations to 
the hopping matrix, thus making the lowest fiat band "nearly flat." 

As for rigorous results, stability of ferromagnetism under a single-spin flip is proved 
in [16, 17] for the model obtained by adding an arbitrary small translation invariant per- 
turbation to the hopping matrix of the translation invariant flat-band Hubbard model of 
Section 6.1. Although this only establishes the local stability of flat-band ferromagnetism, 
it is a remarkable nonperturbative result which applies to a robust class of models. Given 
the global stability of ferromagnetism in the original flat-band model, the local stability of 
ferromagnetism in a nearly-flat-band model provides very strong evidence that the nearly- 
flat-band model indeed exhibits globally stable ferromagnetism. 

Finally, the problem of stability of ferromagnetism was completely solved in [18, 19] for 
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a special class of models. Let us explain the results briefly. 

Take the same lattice A as in Section 6.2, i.e., the d-dimensional decorated hypercubic 
lattice. We define special perturbations to the hopping amplitudes as follows. For each 
z E £, we let 

41 = -s^^^:^^^t\ (6.14) 

where 

{A ii X = z] 
1 if X is an internal site adjacent to z; (6.15) 

otherwise. 
Here s > is a new constant, and A > is the same as in Section 6.1. 

Then the hopping matrix T = (tx,y)x,yeA of the perturbed models is defined by^^ 

M 

t^,y = E^S + E41 + (A's - 2dt)6x,y, (6.16) 

j=i zee 

where tj^y is the same as in (6.2). This t^^y can be written more explicitly as 

(\{t + s) if Ix - yl = 1/2; 

t ii x,y E S and \x ~ y\ = 1; 

if x, y G X and Ix — yl = 1 or 1/v^; (6-17) 



^x,y 



— S 

{\^-2d)t+{\^-2)s iix = yeT, 
otherwise. 

As can be seen also from Figure 12, this model contains nearest-neighbor hoppings and some 
next-nearest-neighbor hoppings (but not more than that). The amplitudes for the different 
hoppings and the on-site potential must satisfy special relations because there are only three 
controllable parameters, t > 0, s > and A > 0. 

The single-electron energy bands of the model can be easily obtained. There are {d + 1) 
bands with dispersion relations 

(-2d{t + s)-2sZt=i^o^K ifj = l; 
£jik) = } X^(t + s)-2dt if J = 2, ■ ■ ■ , d; (6.18) 

[\\t + s) + 2tJ2'l^,cosk^ iij = d+l. 

If c? > 1, the middle bands with j = 2,- ■ ■ ,d have constant values (i.e., are flat and degener- 
ate), reflecting the geometry of the decorated hypercubic lattice^^. For us it is crucial that 
the most important lowest band has a healthy non-constant dispersion. 

^^The term with the Kronecker delta is included only to make the expression (6.17) simpler and is not 
essential. 

^•^It is possible to design short range perturbations to T which make these bands non-flat and non- 
degenerate, while allowing Theorem 6.3 to hold. 
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Figure 12: The li-dimensional decorated hypercubic lattice with the nearest neighbor and 
the next nearest neighbor hoppings for c? = 1 and 2. These models are obtained by adding 
special perturbations to the models (a) and (b) in Figure 9, respectively. In a range of 
parameters, we can prove that the corresponding Hubbard model with sufficiently large but 
finite U exhibits ferromagnetism even when the lowest band is not flat. 
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We consider the Hubbard model on A with the above tx,y and electron number iVe = |£^| = 
L*^. If s = 0, the model exhibits the flat-band ferromagnetism of Theorem 6.1 for any U > 0. 
When s > and the lowest band is no longer fiat, the model exhibits Pauli paramagnetism 
for U = 0. Moreover, from the elementary variational estimate of Theorem 3.2, we find that 
the model does not exhibit ferromagnetism when U < Ads. We can say that there is a true 
"competition" between -f^hop and H^^i in this model. If the model exhibits ferromagnetism, 
it must be in a non-perturbative region with sufficiently large U. The following theorem 
establishes such a non-perturbative statement in the case that the lowest band is sufficiently 
fiat. 

Theorem 6.3 (Ferromagnetism in non-singular Hubbard models) Consider the above 
Hubbard model with electron number N^ = \S\ = L'^, and let^^ A > 0. If s/t is sufficiently 
small, and U/t is sufficiently large, the ground states have total spin S^ot = 5'max(= Nc/2), 
and are non- degenerate apart from the trivial (2S'max + l)-fold degeneracy. 

For given d > 1 and A > 0, the range of s/t and U/t in which the theorem holds can be 
determined by finite calculations which can be (in principle) executed by a computer. This 
means that one can construct a computer-aided proof if one desires. We have only performed 
very elementary calculations for d = 1 with a personal computer. When we set A = \/2 (in 
which case t^^x = for all x), the theorem is valid, for example^^, if s/t < 0.4 and U/t > 40. 

Although the model is rather artificial, this is the first rigorous example of saturated 
ferromagnetism in a non-singular Hubbard model in which we must deal with the competition 
between H[^t and -f^hop- If "we further have s/t <^ 1, U/t ^ 1 and A 3> 1, it has also been 
proved that low-lying excitation above the ground state has a normal dispersion relation of 
spin-wave excitations [18, 17]. Starting from a Hubbard model of itinerant electrons, the 
existence of a "healthy" ferromagnetism has been established rigorously. 

Recently, Penc, Shiba, Mila, and Tsukagoshi [50] made a systematic study of related 
one-dimensional Hubbard models and found various pieces of evidence suggesting that the 
ferromagnetism in these models are indeed robust. 

"'''^In [18], we presented a complete prool for d — 1 which, however, assumed A > Ac > 0. The improved 
proof in [19] applies to general d and only requires A > 0. 

^^For sufficiently large U, the theorem is valid for t = 1.6s, in which case the lower band occupies more 
than 1/4 of the whole range of the single-electron energy spectrum (including the gap). In this case the 
lower band may not even be "nearly-flat" . 
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• Cu OLa QO E^^^di ^ 

Figure 13: A chain of Cu, O, and La formed along the c-axis of La422. One finds a triangular 
structure formed by La and O, which reminds us of the (nearly-)flat-band models of Section 6. 

7 Possible experimental realizations of (nearly-) flat- 
band ferromagnetism 



7.1 Ferromagnetism in La4Ba2Cu20 
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In 1990 (when the only example of saturated ferromagnetism in the Hubbard model was 
that of Nagaoka), Mizuno, Masuda, Hirabayashi, Tanaka, Hasegawa and Mizutani [51, 52] 
reported that the tetragonal cuprate La4Ba2Cu20io (also called La2BaCu05), which we 
shall abbreviate as La422, exhibits ferromagnetism^^. A series of experiments [51, 52, 53] 
has revealed that La422 is an ideal insulating ferromagnet with Curie temperature 5.2 K, 
where most of the magnetic moment comes from the spin 1/2 moments of the Cu^"*" ions. 

The origin of the ferromagnetic coupling between Cu-ions within the ab-plane of La422 
is expected to be described by the GKA rules [52, 54], which involve exchange interactions 
between excited orbital states and ground orbital states. Ferromagnetic coupling along the c- 
axis seems to require more careful treatment. Along the c-axis, La422 forms a characteristic 
chain structure consisting of Cu, O, and La. This is shown in Figure 13. Interestingly 
enough, we see La and O forming a triangular structure similar to those in the (nearly- 
)flat-band models of Section 6. We stress that there are experimental results which strongly 
indicate that the existence of La is essential for the occurrence of ferromagnetism. One such 
result is the observation that the cuprate Nd2BaCu05, which has the same crystal structure 
as La422, exhibits antiferromagnetic ordering [55, 56]. It is pointed out that the essential 
difference is that Nd has a magnetic moment, while La does not. This point is supported by 
the result [57] that ferromagnetism remains stable upon partial replacement of La in La422 
by Eu, which has no magnetic moment. Another such result is from NMR measurement in 
La422 that there is a large hyperfine field at La sites [58]. This suggests that the magnetic 

^^Ferromagnetism was discovered while investigating the superconductor LaBa2Cu307-j;. 
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moments are not sharply localized at Cu-sites, but partially exist on La-sites as well (exactly 
as in the (nearly-)flat-band models). 

On the other hand, a band calculation based on the Local Density Approximation (LDA) 
[59] revealed that two bands of La422 near the fermi level are almost flat^^. 

These observations about the lattice structure and the band structure may be regarded 
as clues that the ferromagnetism in La422 is related to the ferromagnetism in the Hubbard 
model with (nearly-)flat-band. As far as the present author knows, this possible relation was 
first realized by Hirabayashi [60]. 

In fact it is not hard to construct a flat-band Hubbard model on the lattice of Figure 13 
where hopping amplitudes are nonvanishing only on the bonds in the figure. We can prove 
that the model exhibits ferromagnetism when the number of electrons is identical to the 
number of Cu-sites in the lattice. 

This theoretical result, however, seems to be almost irrelevant to the ferromagnetism 
in La422. Analysis of the orbital structures of La422 [54] suggests that a simplified^^ but 
reasonable tight-binding description of the system is given by the lattice structure of Fig- 
ure 13 with one non-degenerate orbit per each site, and with the electron number equal to 
the number of Cu-sites plus the twice the number of 0-sites. (In other words, originally Cu 
orbits are singly occupied, O orbits are doubly occupied, and La orbits are empty.) Then the 
resulting ferromagnetism is expected to be a "partial ferromagnetism" in which only spins 
from Cu-sites align ferromagnetically. Unfortunately, we still do not know how to treat such 
partial ferromagnetism theoretically. 

Nevertheless we believe that there is a deep relation between the ferromagnetism in La422 
and that in Hubbard models with (nearly-)flat-bands. It is a challenging problem to further 
investigate this relation both from the above described tight-binding model^^ and from a 
continuum model. 

If one wishes to see a more direct connection between the ferromagnetism in La422 and 
the (nearly-)flat-band ferromagnetism, one may try reducing the above Hubbard model to 
a simpler one by relying on some approximate argument. For example, one can consider 
the limit in which the on-site potential eo for 0-sites (which is t^^x, with x being an 0-site) 
is negative, and its absolute value is much larger than other It^^yl- In this limit, all the O- 
sites are doubly occupied, and electrons (including those on Cu- and La-sites) cannot move 
on the lattice. If we consider a perturbation expansion in 1/eo up to second order, there 
arise effective hoppings between Cu-sites and La-sites, and between nearby Cu-sites. The 



■''^Iii LDA calculations, the Coulomb interaction between electrons is supposed to be taken into account 
in a certain self-consistent manner. Therefore the flatness of two bands in the LDA calculation does not 
directly indicate that the corresponding tight-binding model has a flat band. 

^^This model is a slight simplification of the tight-binding model discussed in [54]. 

^^ Apart from numerical experiments (which are always possible), systematic perturbative calculations of 
"effective Hamiltonian" along the line of [50, 54] are also possible and may be enlightening. 
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Figure 14: In the second order perturbation in 1/eo, the model of Figure 13 reduces to a 
Hubbard model on the Cu-La chain in the figure with the electron number equal to the 
number of the Cu-sites. The new model falls into the class of (nearly-)fiat-band ferromag- 
netism for suitable hopping amplitudes. This observation provides a crude link between the 
ferromagnetism in La422 and the (nearly-)fiat-band ferromagnetism. 

model then reduces to an effective model with Cu-sites and La-sites, as in Figure 14, with 
the electron number equal to the number of the Cu-sites. For suitable values of hopping 
amplitudes, the model falls into the class of (nearly-) fiat-band ferromagnetism of Section 6. 
This observation provides a crude link between the ferromagnetism in La422 and the (nearly- 
) fiat-band ferromagnetism. 

7.2 Possible ferromagnetism in quantum wires 

Recent developments in nanotechnology are making it possible to design various atomic scale 
structures which should show nontrivial quantum behavior. In [61], Watanabe, Ichimura, 
Onogi, Ono, Hashizume, and Wada made a theoretical study of the electronic properties 
of Ga adsorbates around dangling-bond wires on an H-terminated Si surface, and pointed 
out the possibility of ferromagnetism. The effective tight-binding model (which is not quite 
faithful to the actual lattice structure) used in [61] has the lattice and hopping structure 
shown in Figure 15 and has electron number^° A^e = Ns/3. 

™To be precise, 5/6-filling with Nc ~ 5Ns/3 was considered in [61]. This electron number is changed 
into Nc = Ns/3 by using the hole-particle transformation. (See Appendix B.) One should note that the 



51 



^ ^\ ^\ 



Figure 15: The effective tight-binding model for the "quantum wire." It is possible to design 
Hubbard model with the same structure which shows flat-band ferromagnetism [62]. 

It is possible to construct Hubbard models with exactly the same hopping structure so 
that the models have flat lowest bands [62]. Let the lattice be A = {1, 2, ... , Ng}, where Ng 
is a multiple of three, and impose periodic boundary conditions. The Hamiltonian is then 

Ns/3 
H = t 2_^ 2_^ (-^Csj-l.o- + C3j-2,o- + C3j,cr) \^Czj-l,a + C3j-2,o- + Csj.o-) 

Afs/3 Ns/Z 

+ f/ X(n3j_2,Tn3i-2,i + ^3j,T«3ia) + U'Y^ n3j_i,|n3^„i,|, (7.1) 

i=i i=i 

with t, s, f/, f/' > and A 7^ 0. This model has a flat lowest band with energy 0. For 
electron number A^c = As/3, one can prove that the model exhibits ferromagnetism [62]. 
Since the discussion about the robustness of the flat-band ferromagnetism applies to the 
present models as well, the present ferromagnetism is also expected to be stable against 
perturbations. 

8 Towards metallic ferromagnetism 

8.1 Conjectures and some evidence 

Metallic ferromagnetism is a fascinating phenomenon in which electrons exhibiting ferro- 
magnetism contribute to electric conductivity as well. Whether such simple models as the 
single-orbital Hubbard model can describe metallic ferromagnetism is an unsolved and in- 
triguing problem. 



hole-particle transformation does not only change the signs of the hopping amplitudes, but it also introduces 
a difference in on-site potentials when [/ 7^ [/' in the Hamiltonian (7.1). 
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Nagaoka's ferromagnetism is certainly motivated by metallic ferromagnetism, and we 
believe it reveals some important aspects of a possible mechanism of metallic ferromagnetism. 
But for the models where the theorem is proved, the only dynamical freedom comes form the 
motion of the single "hole." We cannot expect the single hole to contribute to appreciable 
electric current in a bulk system. 

In the flat-band models of Section 6.1 and the related nearly-flat band models of Sec- 
tion 6.6, the existence of ferromagnetism is proved for special electron numbers. These 
electron numbers correspond to the half-filling of the lowest bands, but since the ground 
states are ferromagnetic, the lowest bands become effectively fully filled. Then the systems 
should behave as (Mott) insulators. 

In the flat-band models of Section 6.2 defined in dimensions greater than one, the exis- 
tence of ferromagnetism is proved [7, 8] for (not too small) electron numbers less than \£\. 
There is a similar rigorous result for Mielke's model in two dimensions[15]. We expect the 
models to describe a kind of metallic ferromagnetism, but the situation is not clear because 
of pathological degeneracy in the (many-body) ground states. In any case, the flat-band 
models, which are quite useful in describing the origin of exchange interaction in certain 
systems, are too singular to discuss electric conductivity. 

Promising candidates of simple models exhibiting metallic ferromagnetism are the nearly- 
flat-band models (obtained by adding perturbations to the flat-band models) at filling factor 
V = Ne/{2Ns) different from uq = |£^|/(2A''s). For the case z/ < z/q, the approximate projection 
method (similar to those described in [8, 17], but based on a local Wannier basis) indicates 
that the low energy effective theory of these models is represented by the ferromagnetic t-J 
models in any dimension [19]. (The definition of the ferromagnetic t-J model can be found in 
Section 8.2.) This observation leads us to the conjecture that these models exhibit metallic 
ferromagnetism at least when z/ < z/g is not too small and the model is sufficiently close to 
a flat-band model. 

Penc, Shiba, Mila, and Tsukagoshi [50] made a detailed study of related problems in 
one dimension, and found both theoretical and numerical evidences that the model exhibits 
metallic ferromagnetism for arbitrary filling factor within the range < z/ < z/q = 1/4. That 
the ferromagnetism is stable for an arbitrarily small density refiects the special character of 
one-dimensional systems as noted in Section 3.3. Sakamoto and Kubo [63] also found strong 
numerical evidence that related one-dimensional models exhibit metallic ferromagnetism for 
< z/ < z/q. 

Watanabe and Miyashita [64, 65] treated the one-dimensional flat-band Hubbard model 
of Section 6.2 for a filling factor z/ in the range 1/2 > z/ > z/q = 1/4. Since the lower 
band is totally filled for such electron densities, the fiatness of the lower band is regarded as 
irrelevant. Note that the case with iVe = iVg — 1 is nothing but Nagaoka's ferromagnetism; 
we have come back to Nagaoka's ferromagnetism following a different path! Watanabe and 
Miyashita found numerical evidence that the model exhibits metallic ferromagnetism for all 
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densities they considered. 

A different (but in some sense related) candidate for a model with metallic ferromag- 
netism is the so-called t-t' model. It is defined on the one-dimensional lattice A = {1, . . . , As}, 
and its hopping amplitudes are defined by tx,x+i = tx+i,x = —t, tx,x+2 = tx+2,x = t' for all 
X E A and tx^y = otherwise. From a first order perturbation theory, it was suggested [66, 67] 
that the model with U = oo exhibits ferromagnetism if t' > 0. By considering a continuum 
limit theory, Miiller-Hartmann [68] argued that the model exhibits metallic ferromagnetism 
when 4t' > |t| > 0, f/ = cxD, and the electron density is sufficiently low. There is numerical 
evidence that the t-t' model exhibits ferromagnetism [26, 69]. 

Kohno [70] discussed the possibility of metallic ferromagnetism in the Hubbard model 
on a ladder. Both perturbation theory and numerical calculations suggest that metallic 
ferromagnetism appears for electron numbers satisfying As/2 < Ac < As- It is interesting 
that, in this model, itinerant electrons in upper bands play essential roles in generating 
ferromagnetism. 

Unfortunately, none of the above conjectures have yet been confirmed rigorously^^ . 

8.2 Ferromagnetic t-J model 

In the present and following sections, we discuss some rigorous results concerning (metallic) 
ferromagnetism in one-dimensional models with U ^ oo related to the Hubbard model. 
Thanks to the special nature of one-dimensional models, all the results can be obtained by 
straightforward applications of the Perron- Frobenius theorem. As far as we know, such an 
application of the Perron-Frobenius theorem to metallic ferromagnetism in one dimension 
was developed in the pioneering work of Kubo [72], where he studied the double exchange 
model. Although the results we shall discuss appear to be somewhat similar to the conjec- 
tures in the previous section, we still have no idea if they shed light on the truly fascinating 
problem of metallic ferromagnetism in the Hubbard model. 

We first discuss the ferromagnetic t-J model. This is not an interesting model (in one 
dimension), but helps us in illustrating the basic structure of the proof that we use for 
other models. Unlike the Hubbard model, this model contains spin-spin interactions which 
explicitly favor ferromagnetism. The problem is whether ferromagnetism is realized in the 

^"'^In a fermion system on a finite lattice, a formal perturbation series alway converges because the operators 
are finite dimensional. Then one might think that the first order perturbation theories of [66, 67, 70] 
imply weak rigorous results that a finite model exhibits ferromagnetism for sufficiently small t' /t (or iii/t^). 
However, this is not the case since there are no estimates of the energy difference between the ground state 
and the first excited state, and there remains a possibility that the higher order perturbations change the 
nature of the ground state for any small values of the expansion parameter. The situation is different from 
that of Nagaoka's ferromagnetism, where the result for t/ = oo automatically implies the same result for 
sufficiently large U (in a finite system). For an example of rigorous (and general) perturbation theory for 
quantum many-body systems, see [71]. 
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presence of hopping. As we shall see (and is well-known), the problem is trivial in one 
dimension. 

Consider the one-dimensional lattice A = {l,...,A's}. The Hamiltonian of the ferromag- 
netic t-J model is 

Ns Ns Ns 

H = -tJ2Yl (4,.c.+i,. + h.c.) - j5^(S. ■ S.+i - ^^i^) + Uj2n.,^n,,i, (8.1) 

x=l o-=t,i x=l x=l 

with t > 0, J > 0. Here h.c. represents the hermitian conjugate. We let f/ ^ oo to inhibit 
double occupancies. We use periodic boundary conditions and identify the site A^g + 1 with 
1. The spin operator at site x is defined as S^; = '^^ t '^l aiP^°'^)o;T Cx,a- (See (3.1).) 

Theorem 8.1 (Ferromagnetism in the one-dimensional ferromagnetic t-J model) 

Assume that the electron number N^ satisfies N^. < Ns and is odcf^. Then the ground states 
of the present t-J model have total spin Stot = Sma.x{= Ne/2), and are non- degenerate apart 
from the trivial (2S'max + ^)-fold degeneracy. 



Proof: The proof is based on an elementary observation which one may call a rigorous "spin- 
charge separation" argument. (See [50] and references therein.) To take into account the 
U ^ oo limit (with the help of Lemma 4.2), we use as our basis the collection of the 
states (2.10) with xi < X2 < ■ ■ ■ < x^^ and aj =t, i such that S^J^ = ^jIiCTj = 1/2. 
The theory of angular momenta ensures that any eigenstate of H has its copy in the sector 

(3) 

with the lowest 1 5*^01 1- This sector is spanned by this basis. We first claim that all the 
matrix elements of H in the present basis are nonpositive. As for the hopping term, this 
is trivial since there is no exchange in electron ordering and one does not have to worry 
about fermion signs^^. As for the exchange term, this is easily verified by using the identity 
S^ ■ Sa;+i = {S^S^^i + S^S^^i)/2 + Sx 'S'^+i- It is also verified that all the basis states are 
connected via nonvanishing matrix elements of H. Note that the existence of the exchange 
term is essential for this property. Therefore we can readily apply the Perron-Frobenius 
theorem (Theorem 4.3) to conclude that the ground state $gs is unique (in this sector), and 
it is a linear combination of all the basis states with positive coefficients. 

This fact indeed implies that for <I>gs; 'S'tot = Sma.x- To see this, we first note that 
uniqueness of the ground state implies that $gs is an eigenstate of (Stot)^- We then take 

^^We can allow any A'e < A's if we use open boundary conditions. Peculiar dependence of the nature of 
the ground states on the parity of the electron number and boundary conditions is related to the appearance 
of spiral states. See [72, 73, 50, 65]. 

^"^Hops between sites 1 and A^o are exceptional, but these do not produce any signs if Nc is odd. When 
iVe is even, such hops generate a "wrong" sign for the Perron-Frobenius theorem. This "frustration" can be 
regarded as the origin of the spiral states [72, 73, 50, 65]. 
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( ^_ \(Afo-l)/2 / N (3) , 

a reference state $ = (5*^0^) (Hxii'^xt) *^vac, which obviously has S\J^ = 1/2 and 

'S'tot = 'S'inax- Clearly $ is written as a linear combination of basis states (2.10) with nonneg- 
ative coefficients. Thus we see ($, $gs) ¥" 0; which implies that S'tot = S'max also holds for 

$GS- ■ 

Once knowing that the ground states have S'tot = S'max, we can look at the ground state 
with S'tQt = S'max- Then it is apparent that the term (S^; ■ Sa;+i — n^n^+i/4) always vanishes, 
and hence the ground state is identical to that of -f^hop- Then we have the exact expression 
for the ground state 

$GS=( n ^T(V'^))<^vac, (8.2) 

\k s.t. |fc|<fcF J 

where rj^''^ is the plane wave state (2.5), and kp = n(Nc — 1)/(2L) is the fermi momentum. 
The ground state allows gapless (charge) excitations, and hence describes a metallic system. 

Unfortunately, the above theorem and proof do not shed any light on the corresponding 
problems in higher dimensions, which are much more important and interesting. The proof 
for the one-dimensional case relies heavily on the fact that essentially no nontrivial spin 
exchanges take place when U = oo and J = 0. The system suddenly becomes ferromagnetic 
when we turn on arbitrarily small J > 0. 

In the corresponding problem in higher dimensional lattices (or even on slightly more 
complicated one-dimensional lattices, like a ladder), there are highly nontrivial exchange 
processes even for J = 0. There is no guarantee that one gets ferromagnetism when intro- 
ducing an explicit ferromagnetic interaction, J > 0. We of course believe that the ferro- 
magnetic t-J models in higher dimensions also exhibit ferromagnetism for sufficiently large 
J/t and p = Nc/Ng, but we have no idea how one can prove such a statement. Although 
"proving ferromagnetism in the ferromagnetic t-J models" might sound like a tautology at 
first glance, it is indeed a very difficult and deep problem, whose solution should shed light 
on various aspects of strongly interacting itinerant electron systems. 

8.3 Hubbard model with correlated hopping 

In this section, we discuss an artificial model which (like the Hubbard model, and unlike the 
t-J model) have no interactions explicitly favoring ferromagnetism, but in which we can still 
prove the appearance of (probably metallic) ferromagnetism in a wide range of parameters 
and electron density. 

Take the one-dimensional lattice A = {l,...,As} with even Ng, and impose periodic 
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boundary conditions by identifying Ng + x with x. We consider the Hamiltonian 

N, Ns/2 N, 

H = -tY^ 5^(4,aCx+i,a + h.c.)+t'^ ^{clj^^C2j+2,an2j+i+h.c.) + U^n^^^n^^l, (8.3) 

x=l o-=t,i i = l <J=hl 2:=1 

where we set t > 0, t' > 0. We will take the U ^ oo limit. 

This resembles the Hubbard model with next nearest neighbor hopping, but the hopping 
term with t' has an extra number operator n2j+i = n2j+i,| + n2j+i^i. Thus a hop between 
2j and 2j + 2 is possible only when 2j + 1 is occupied by an electron. Note, however, that 
such density- dependent hopping terms themselves are completely spin-independent, and do 
not prefer ferromagnetism. Nevertheless we have the following. 

Theorem 8.2 (Ferromagnetism in the model with correlated hopping) Consider the 
model (8.3) in the limit U ^ oo with any odd electron number N^. Then the ground 
states have total spin Stot = 5'max(= Nc/2), and are non- degenerate apart from the triv- 
ial (2S'max + I) -fold degeneracy. 

Proof: Exactly as in the proof of Theorem 8.1, we take account of the U ^ oo limit by using 
the basis states (2.10) with xi < 0:2 < • • ■ < x^^ and aj =t, i such that S^J = X]i=i ^i — V2- 
It is again trivial to check that the nearest-neighbor hopping terms in (8.3) have nonpositive 
matrix elements. By definition, an electron hops between even sites x and x + 2 only when 
X + 1 is occupied. Then, such a hop is always associated with a change of ordering in 
electrons. (See Figure 16.) This yields a minus sign, resulting again in nonpositive matrix 
elements. 

It remains to prove that all the basis states (with S^^l = 1/2) are connected with each 
other by nonvanishing matrix elements. However, this is already proved in the one hole 
case in Section 4.3, and extensions to the multiple holes are trivial. Thus we can apply the 
Perron- Frobenius theorem exactly as in the proof of Theorem 8.1. ■ 

We believe that the ferromagnetic ground states of the present model have a metallic 
character. But this is not as obvious as in the case of the t-J model. The model remains 
strongly interacting in the sector with S^^l = S'max, and it is not easy to investigate the 
properties of the ground states. 

It is clear that the above theorem extends to much more general models with additional 
density-density interactions and on-site potential provided that there are nearest neighbor 
and next nearest neighbor (possibly site-dependent) hoppings with the correct signs. An 
interesting extension is obtained by replacing the second sum in (8.3) over even sites with a 
sum over all the sites as 

N, N, N, 

Hi = -t^Y^ (4,aCa;+i,a + h.c.) +t'YYl i(i,aCx+2,an^+i + h.c.) + f/ ^ n^^n^d' (8-4) 

X = l 0-=T,i 31 = 1 0-=T,i X = l 
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This defines a translation invariant model which resembles the t-t' model. Of course the 
same proof as above shows that the model exhibits ferromagnetism for U = oo. 

To see the relation between the model (8.4) and the t-t' model, consider the Hamiltonian 

H2 = Hi + t" J2 Yl {4,aC.+2,<x(l - n,+i) + h.c.}. (8.5) 

a;=i (7=1,1 

When t" = t', this becomes the Hamiltonian of the t-t' model, while it defines a controllable 
model (8.4) when t" = 0. One finds that the t"-term in (8.5) (along with other terms) 
generates an antiferromagnetic interaction. Therefore in order to control the possible ferro- 
magnetism in the t-t' model, one has to deal with the competition between the t'-term and 
the t"-term in (8.5), which seems to be a nontrivial task^^. 

8.4 Limiting U-V model 

We consider another toy model (which perhaps is slightly less artificial than the one in the 
previous section) obtained by adding a strong nearest-neighbor repulsion to the Hubbard 
model. Again we can prove the existence of (metallic) ferromagnetism easily. 

Take the one-dimensional lattice A = {l,...,A''s} with even Ng, and impose periodic 
boundary conditions. We consider a model with electron number N^ in the range Ng > Ne > 
Ns/2, and with the Hamiltonian 

Ns Ns/2 

H = -t^Yl d(rCx+l,a + h.c.) +t'YYl (4i,aC2i+2,<x + h.C.) 

+U J] n,,tn,,i + V^ H X^ n^n^+i J - {2N, - N,)\ , (8.6) 

where we require t > and t' > 0. We will consider the U,V ^ oo limits. The new nearest 
neighbor repulsion term is normalized so that the minimum energy for this term is zero. 
Note that the hopping amplitudes in the above model can be gauge transformed^^ to satisfy 
the condition for Nagaoka's theorem. 

As in the Hubbard model, none of the terms in (8.6) explicitly favor ferromagnetism. 
Nevertheless we can prove that they together generate ferromagnetism. 

Theorem 8.3 (Ferromagnetism in the limiting U-V model) Consider the model (8.6) 

in the limits U ^ oo and V ^ oo with odd Nc such that Ng > Nc > Ns/2. Then the ground 

^■^One does not face this competition in first order perturbation theory [66, 67]. 
^^Take A' in Appendix A as the set of even sites. 
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Figure 16: The essential electron exchange process in the correlated hopping model and the 
limiting U-V model. In the limiting U-V model with N^ > Ns/2, sites x — 1 and a; + 1 
must be occupied when x is empty. Then the ordering of electrons always changes when 
an electron hops from a; + 2 to x. In the model with correlated hopping, the same fact is 
guaranteed by the definition. 

states have total spin Stot = 'S'inax(= ^c/2), and are non- degenerate apart from the trivial 
(25'max + I) -fold degeneracy. 



Proof: We take into account the K -^ oo limit exactly as in Lemma 4.2. Now our basis states 
are (2.10) with a;i < X2 < ■ ■ ■ < xn^ and ctj =1, |, where Xi, . . . , x^^ must further satisfy the 
condition that the nearest neighbor repulsion term in (8.6) is minimized. When A^e > ^s/2, 
this condition implies that there can be no neighboring holes in the relevant basis states. 
This guarantees that an electron hops between even sites x and x + 2 only when a; + 1 is 
occupied. (See Figure 16.) The remainder of the proof is the same as that of Theorem 8.2. ■ 

Again, this result can be extended to various more complicated models. 

Note that the nature of the model is quite different for electron numbers N^ < Ns/2 
(in which case we drop — (2iVe — A^s) from the Hamiltonian (8.6)). In the limits U ^ oo 
and V ^ oo, configurations with neighboring electrons are inhibited, and hence exchange 
processes never take place. In this case the model should exhibit paramagnetism. This is 
consistent with a numerical work [74] where a shift in the ferromagnetic region as a result 
of finite V is reported for A"e ^ Ns/2. We expect that a finite value of V enlarges the region 
of ferromagnetism for A"s/2 < Nc < Ns. 

A Gauge transformation 

The signs of the hopping amplitude t^^y can be partially changed by means of gauge transfor- 
mations. Let A' be an arbitrary subset of A. We define the new operators Cx,cr by Cx,a = ~Cx,u 
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if a; G A' and Cx,a = Cx,a if a; ^ A'. Since Cx^a also satisfy the canonical anticommutation rela- 
tions (2.6) and (2.7), we can use these operators to describe the system. Then the hopping 
Hamiltonian (2.26) is written as 

-"hop — / ^ '^x,y^x,a^y^'yi \'^-^) 

x,y&A. 

with t[^y = tx,y if both x and y are in A' or if neither x nor y is in A', and t[^y = —tx,y if 
exactly one of x and y is in A'. 

The model is said to be bipartite when there is a subset A' with the property that tx^y = 
if both X and y are in A' or if neither x nor y is in A'. In a bipartite system, we can use 
the above gauge transformation to change the signs of all the hopping amplitudes. A typical 
example is the model on the simple cubic lattice with only nearest neighbor hoppings. 

B Hole-particle transformation 

Let us discuss a simple and standard transformation which maps a Hubbard model onto a 
different Hubbard model with (usually) different electron number. 

We define the new operators Cx^a by Cx^a = Cxa- This simply means that we switch the 
creation and the annihilation operators. Since the Cx^a also satisfy the canonical anticom- 
mutation relations (2.6) and (2.7), we can use these operators to describe the system. From 
(2.6), the new number operator hx^a = cl^Cx^a is related to the original number operator 
by Ux^cr = 1 — nx,a- Therefore the new total electron number A'"e (which is an eigenvalue of 
J2xeA o-=t I ^x,a) is related to the original electron number by A^c = 2A^s — A^e- 

The hopping Hamiltonian (2.26) is written as 

-"hop = / ^ \''x,y)Cx^a'^y,a + 2 ^ ^ tx,xi l-t^.ij 

x,yGA xSA 

and the interaction Hamiltonian (2.31) as 

-f^int = f/^(l -n^,T)(l -n^,i) 
xeA 



U Y^ fix,^hx,i -UY^ fix,a + NJJ. (B.2) 



a:eA xgA 

Note that (B.2) has exactly the same form as the original interaction Hamiltonian (2.31) 
apart from the constant term and the term proportional to the total electron number, which 
simply shift the total energy. 
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C Positive semidefinite operators 

We summarize the definition and elementary properties of positive semidefinite operators 
(or matrices) which we used in the main body of the paper. They should be well-known to 
readers with a mathematical background. 

Let ?i be a finite dimensional Hilbert space with inner product (■,■). In the paper, Ti 
may be TIn^ or [). 

Definition C.l (Positive semidefiniteness) For an operator (or matrix) on Ti, we write 
A > and say A is positive semidefinite if A is self-adjoint (or hermitian) and we have 
($, A$) > for any $ G 7i. For two self-adjoint operators A and B, we write A > B if 
A- B >0. 

The following statement is easily proved by diagonalizing A. 

Lemma C.2 (Positive semidefiniteness and eigenvalues) A self-adjoint operator (or 
a hermitian matrix) A is positive semidefinite if and only if all the eigenvalues of A are 
nonnegative. 

The following lemma provides a standard way of constructing a positive semidefinite 
operator. 

Lemma C.3 Let B be an arbitrary operator (or matrix) on Ti. Then A = B^B is positive 
semidefinite. 

Proof: Observe that for any $, we have ($,_B'^_B$) = (5$, 5$) > 0. ■ 

Conversely, any A > can be expressed as A = 5^ with some B >Q. This is most easily 
verified by diagonalizing A. 

The following lemma is used repeatedly in the main body of the paper. 

Lemma C.4 (The sum of positive semidefinite operators) //A > and B > 0, we 

have A + B >0. 



Proof: Assume that A + B has an eigenstate $ with a negative eigenvalue. Then we get 
> ($, {A + -B)$) = ($, A$) + ($, 5$) > 0, which is a contradiction. ■ 

The following is trivial if we expand $ into eigenstates of A. 

Lemma C.5 Let A > 0. Then ($, A<l>) = zs equivalent to A$ = 0. 
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The following lemma is also useful. 

Lemma C.6 Let Ai > for i = 1, . . . ,n. Then Y17=i ^i^ ~ ^ irnplies Aj$ = for each 
i = 1, . . . ,n. 

Proof: Since EILi ^i"^ = 0, we have = ($,Er=i^^'^) = EILi {'^,A^)- By noting that 
($, Aj$) > 0, this means ($, /lj$) = for each i = 1, . . . ,n. We then use Lemma C.5. ■ 

We finally state the following lemma, which sometimes provides us with a powerful 
information. 

Lemma C.7 Assume that A > is expressed as A = B^ B where B is not necessarily 
self- adjoint. Then A$ = implies 5$ = 0. 

Proof: Since B^'B^ = 0, we have = ($, B^B^) = (5$, 5$) which means 5$ = 0. ■ 



D Explicit construction of the Hilbert space and fermion 
operators 

In section 2.2, we first introduced the algebra of fermion operators, and then defined the 
Hilbert space by operating (a representation of) the algebra onto a single state $vac- There 
is no problem in making the discussion mathematically rigorous since both the algebra and 
the Hilbert space are finite dimensional. However, readers familiar with fields like functional 
analysis might feel more comfortable if the Hilbert space is first defined explicitly and then 
the operators are defined. Here we explicitly define the Hilbert space and the fermion 
operators following the standard approach^^. 

Let A = A X {TjI} be the configuration space for a single electron. We denote its 
elements as^^ u,Ui,U2,- ■ ■ G A. The Hilbert space for a single electron is £^(A;C). For 
n = 0, 1, . . . , 2A^s, we define the n-electron Hilbert space 7i„ by Tio = C, and 

nn = P-i\A;C)^---^i\A;C), (D.l) 



^^For more details, see, for example, Section 5.2 of [75]. 
^"^Of course u — {x, a) with x E A and a =t. |. 
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where P_ is the projection operator onto functions which are antisymmetric under exchanges 
of any two variables. More precisely, it is defined as 

'^^ (-l)P?/'(Mp(l),...,Mp(„)), 



P_V'(mi, •••,«„) 



n\ 



E 



(D.2) 



p:(l,...,n)^(p(l),...,p(n)) 

where p is summed over all permutations of (1, . . . , n), and (—1)^ denotes the parity of p. 

We now define the fermion operators. For ip G TYq, we let Cuip = 0. For ip G 7Y„ with 
n > 1, we let 

{Cu1p){Ui, ..., Un^l) = y/nij{u, Ml, ... , Un-l), (D.3) 

where Cuip G 'Hn-i- On i' ^ T^n with n < 2Ns, the adjoint operator c]^ acts as 

-, n+l 

(cI^)(Mi, • • • , M„+l) = ^ — - y'(-l)-'+^(5„,„ V^(Mi, . . . , Mj-1, Mj+i, . . . , M„+i), (D.4) 

Vn + 1 jrt 

where cl^ G 7i„+i. For ■?/' G 7Y2Afs, we let c]^-?/' = 0. 

It is natural to regard the operators c„ : Tin -^ 'Hn-i and c\ : 7i„ ^ 7Y„+i as acting on 
the Fock space 



2N, 



J — (1) rin- 



(D.5) 



ra=0 



Finally, we take a basis state (say 1) of the Hilbert space TYq = C, and identify it with 
$vac- Then it is not difficult to check that states of the form (2.10) form a basis of the Hilbert 
space 'Hn^- 

As examples, let us calculate / = (c]^<l>vac) G 'H\ and g = (cj^,/) = (cj^,cj<l>vac) £ ^^2, where 
we understand that <l>vac = 1- From (D.4), we have f{ui) = (cJ,l)('Ui) = 6y^ui- Similarly we 
get 



9[Ul,U2) 



{clf){ui,U2) 

-7= {5w,uJiu2) - (5^,«2/(Mi)} 

1 



V2 



\^W,Ul^V,U2 ^W,U2^V,Ul ) • 



(D.6) 



Note that g is antisymmetric under the exchange of the names of the two particles. This is 
a special case of more general states 



(4i ■■ -Cl^ "^ vac) (Mi, ••-,««) 



1 



"1)1 ,M2 ^V2,U2 ' ' ' ^Vn,U2 



'Jvi,Un '^V2,Un ' ' ' '~^Vn,Un 



(D.7) 



which are known as the Slater determinant states. 
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E Band structures in single-electron energy spectra 

Here we explain (for readers without background in condensed matter physics) the notion of 
band structure in a single-electron problem. The readers will find the notion quite elemen- 
tary^^, especially in a tight-binding model. 

We realize the lattice A as a subset of the d- dimensional L x ■ ■ ■ x L torus 



TL = {{xi,...,Xd) 0<a;, <L|=M7~ (E.l) 

where the identification ~ is defined by a; ~ y <^==^ x — yE UL^ . Here L is a positive integer. 
Let the unit cell U C [0, 1)'^ be a set of finite points (sites). The number of sites \U\ in the 
unit cell will determine the number of bands. Assume that the lattice A can be written as 



A 



ix^z a; G W, z G {0, 1, . . . , L - 1}'^} . (E.2) 



We then assume that the hopping matrix T = {tx^y)x,y£A has translational invariance^^ 

for any x,y E A and z G 7^'^. We note that perfect translational invariance is never possible 
in real physical systems which always have boundaries™. We often take artificial periodic 
boundary conditions because we want to concentrate on universal behavior taking place in 
the bulk of the system. 

We are interested in the eigenvalue problem ecp = Tcp or 

j/GA 

for a single electron. The translational invariance (E.3) suggests that we look for eigenstates 
in the form 

^x = e'^'X(x), (E.5) 

where /c G /C is a wave vector, u{x) is the unique element in U such that x — u{x) G Z'^, 
and V = {vu)ueu is a |W [-dimensional vector. The set of wave vectors /C consists oi k = 

^^In the band theory of electrons in soUds, one usually talks about band structures in an "effective single- 
electron problem" (defined through complicated self-consistency arguments) in interacting many-electron 
problems. To explain this theory is beyond the scope of the present article (and the ability of the author). 

^®We of course identify x + z and y + z as elements in Tl by using the identification ^ if necessary. 

^°The presence of boundaries does not change the band structure drastically, but introduces extra eigen- 
states which mainly live on the boundaries. Analysis of single-particle eigenstates may not be easy. 
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Figure 17: The dispersion relations and the corresponding band structure of the simple one- 
dimensional model with next-nearest-neighbor hoppings with t = t'/2 > and V = 0. The 
spectrum consists of the two bands separated by a finite band gap. 



((27r/L)rai, . . . , {2ti / L)nj) with rij = 0, ±1, . . . , ±{(L/2) — 1}, L/2 (where we assume L to be 
even). By substituting (E.5) into (E.4), we get 



E(fe) 



with 



T, 



'GW 



(fe) _ v^ JHv-u) 

u,u' ~ / , ^u,y c 

j/GAs.t. u{y)=u' 



(E.6) 



(E.7) 



For each k & }C, (E.6) has \U\ eigenvalues that we denote as ej{k) with j = 1, . . . ,\U\. We can 
choose ej{k) so that ej{k) is an analytic function oi k E {—7i,ttY for each j. Consequently, 
the spectrum of the hopping matrix T is decomposed as 



\u\ 






(E.8) 



with o"j(T) = <ej{k) k E K,?. Each crj(T) is called an energy band. The function ej{k) is 

called the dispersion relation of the j-th band. 

Let us discuss a simple example with two bands. Let d = 1, and take U = {0, 1/2}. 
The resulting lattice is A = {0, 1/2, 1, 3/2, . . . ,L — (1/2)}. We define the hopping matrix 
T = (t^,y)a;,yeA by t^^y = t' if \x -y\ = 1/2, t^^y = tiix,yEZ and \x - y\ = 1, t^,^ = V ii 
X + 1/2 G Z, and tx,y = otherwise. Then the eigenvalue equation (E.6) becomes 



^^0 



2tcosA; 2t' cos(A;/2) \ / vq 
2t'cos{k/2) V J Vt;i/2 



(E.9) 
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The two eigenvalues of (E.9) define two dispersion relations 

£1 2(A;) = ^iv + 2tcoskT Vi^ - 2tcos A;)^ + (4t'cos(A;/2))2| . (E.IO) 

See Figure 17. 

Consider special cases where the parameters t' and V can be written as t' = At and 
V = (A^ — 2)t with the parameter A > 0. Then (E.IO) becomes ei{k) = —2t and e2{k) = 
\^t + 2tcosk. The model becomes the flat-band modeF^ discussed in Section 6.3. 

F Proof of Theorem 3.3 

We describe estimates required for the proof of Theorem 3.3. Although the calculations 
are straightforward, they illustrate some standard techniques we encounter in many-fermion 
problems and may serve as a good exercise for beginners. 

First note that the sate \E' defined in (3.6) satisfies fix^iny^i"^ = for any x ^ y since 
there is only one | electron. Then the definition (3.9) of the variational state \1' simplifies as 

$ = Po$ = (1 - z>)^, (F.l) 

with z> = XlxeA^a;,T^^,i- To evaluate the energy expectation value of (3.11), we observe that 

(^^, ^) = (Po^, Po^) = (^, Po^) = 1 - (^, 0^) , (F.2) 

and 



^,H^op^j = (^,(l-Z>)/Jhop(l-^>)^) 

= E^(l-2(^,z>^)) + (^,z>i7hop^>^), (F.3) 

where we used i/hop^ = E^"^ and (\E', \E') = 1. Thus we need to estimate the quantities 
(^,z>^) and (^,z>i7hop/>^). 

The best way to evaluate these quantities is to go into a Fourier representation and 
express everything in terms of the aj^a operators of (2.27). By using the completeness 
relation ^jlii'ipji Yw = 6x,x', we find the inverse of (2.27) is 

c.,. = 5^^i^^,,.. (F.4) 



^^We recover the dispersion relation at the end of Seetion 6.3 if we shift the energy by 2t. 
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This leads us to the representations 

p,q,r,s=l 

with 

V,,,,r,s = J2('^^'^y'^i'\^^:^M'\ (F.6) 

and 

All we have to do now is substitute (F.5) and (F.7) into the desired quantities, and use the 
anticommutation relations {aj^o-,afc,r} = 0, {a]^o-, a^.r} = Sj,kSa,T, the definition (3.6) of \E', 
and the fact that 0^,0-'^ vac = 0. Then we obtain 

Afe-l 

,1,1 



p=l 



p=i 

A^c-l 



xeA \ p=i 

/A^e-l \ / A^e-l 

V j = l / x€A \ p=l 

xeA \ p=i / 

and 

{/iVo-l \ /Nc-l 

\ p=i / \ <j=i 

2;,j/eA I V p=l / \g=Afo / 

xeA [ \p=i / J 
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with 

a;,i/GA I V p=l / V g=l 



E ^-. E (^^'y^f E ^i^^(v^i^^)* (v^i'^)>i'^ ' (F-ii; 



x,yeA {. \ p=l / \ q=l 

s.t.xj^y 



where we have used the fact that X]«ii ^p(w )*V^^ — ^x,y and X]p=i(V^^ )*W — ^x,y We 
also made the obvious decomposition ifhop = HJ + H}. 
By collecting all the estimates, we get 



^-^ = /^ ^\ -Eferro 



with 



^ ^<maxt^^. (F.13) 



E.eA(l^^'¥E;:7M4' 



To proceed further, we assume that there is a constant a > independent of x, Ns and 

p = 1, . . . , Ng, and we have'^^ 

|^(P)| < ^. (F.14) 



Note that if the model possesses translational invariance as in Appendix E, then the bound is 
true with a = v|W|. We also assume that max^gAt^^a,, max2;eA X]«eA l^^,j/l' ^^^ ^i converge 
to finite quantities as iVg — ;► oo, which is again trivially valid in translation invariant models. 
With the requirement (F.14), we obtain \R\ < const. p^, which means that R gives only 
negligible contributions for small p. Since we also find (\E', z>\E') oc p, the bound (F.12) 
implies the desired estimate (3.11). 

It is a pleasure to thank Izumi Hirabayashi and Koichi Kusakabe for valuable discussions 
related to the material in Section 7.1, and Shun-Qing Shen for valuable discussions about the 
t-t' model. I also thank Fumihiko Nakano and Yuuki Watanabe for various useful comments 
on the manuscript. 



^^This assumption rules out the possibility of localized eigenstates. 
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